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TITLE 

A METHOD FOR ALTERING THE ISOFLAVONOID PROFILE 
IN THE PLANT PARTS OF AN ISOFLAVONOID-PRODUCING PLANT 
This application claims priority to U.S. Provisional Application No. 60/297,981, 
5 filed June 13, 2001 incorporated herein by reference in its entirety. 

This invention pertains to methods of altering the ratios of individual 
isoflavonoids in isoflavonoid-producing plants by using a C1 myb transcription factor 
and an R-type myc transcription factor that regulate expression of genes in the 
phenylpropanoid pathway. 

10 Isoflavonoids represent a class of secondary metabolites produced in legumes 

by a branch of the phenylpropanoid pathway and include such compounds as 
isoflavones, isoflavanones, rotenoids, pterocarpans, isoflavans, quinone derivatives, 
3-aryl-4-hydroxycoumarins, 3-arylcoumarins, isoflav-3-enes, coumestans, 
alpha-methyldeoxybenzoins, 2-arylbenzofurans, isoflavanol, coumaronochromone 

15 and the like. In plants, these compounds are known to be involved in interactions 
with other organisms and to participate in the defense responses of legumes against 
phytopathogenic microorganisms (Dewick, P. M. (1993) in The Flavonoids, 
Advances in Research Since 1986, Harbome, J. B. Ed., pp. 117-238, Chapman and 
Hall, London). Isoflavonoid-derived compounds also are involved in symbiotic 

20 relationships between roots and rhizobial bacteria which eventually result in 
nodulation and nitrogen-fixation (Phillips, D. A. (1992) in Recent Advances in 
Phytochemistry. Vol. 26, pp 201-231, Stafford, H. A. and Ibrahim, R. K., Eds, 
Plenum Press, New York), and overall they have been shown to act as antibiotics, 
repellents, attractants, and signal compounds (Barz, W. and Welle, R. (1992) 

25 Phenolic Metabolism in Plants, pp 139-164, Ed by H. A. Stafford and R. K. Ibrahim, 
Plenum Press, New York). 

Isoflavonoids have also been reported to have physiological activity in animal 
and human studies. For example, it has been reported that the isoflavones found in 
soybean seeds possess antihemolytic (Nairn, M., et al. (1976) J. Agric. Food Chem. 

30 24:1174-1177), antifungal (Nairn, M., et al. (1974) J. Agr. Food Chem. 22:806-810), 
estrogenic (Price, K. R. and Fenwick, G. R. (1985) Food Addit. Contam. 2:73-106), 
tumor-suppressing (Messina, M. and Barnes, S. (1991) J. Natl. Cancer Inst. 
83:541-546; Peterson, G., et al. (1991) Biochem. Biophys. Res. Commun. 
779:661-667), hypolipidemic (Mathur, K., etal. (1964) J. Nutr. 84:201-204), and 

35 serum cholesterol-lowering (Sharma, R. D. (1979) Lipids 74:535-540) effects. 
These studies indicate that isoflavones in soybean protein products may produce 
many significant health benefits. 
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Free isoflavones rarely accumulate to high levels in soybeans. Instead they 
are usually conjugated to carbohydrates or organic acids. Soybean seeds contain 
three types of isoflavones in three different forms: the aglycones, daidzein, genistein 
and glycitein; the glucosides, daidzin, genistin and glycitin; and the 
5 malonylglucosides, 6"-0-malonyIdaidzin, 6"-0-malonylgenistin and 6"-0- 
malonylglycitin. During processing acetylglucoside forms are produced: 6'-0- 
acetyldaidzin, 6'-0-acetyl genistin, and 6'-0-acetyI glycitin. The content of 
isoflavonoids in soybean seeds is quite variable and is affected by both genetics 
and environmental conditions such as growing location and temperature during 

10 seed fill (Tsukamoto, C, et al. (1995) J. Agric. Food Chem. 43:1 184-1 192; Wang, H. 
and Murphy, P. A. (1994) J. Agric. Food Chem. 42:1674-1677). In addition, 
isoflavonoid content in legumes can be stress-induced by pathogen attack, 
wounding, high UV light exposure and pollution (Dixon, R. A. and Paiva, N. L. 
(1995) Plant Cell 7:1085-1097). The genistein isoflavonoid forms make up the most 

15 abundant group in soybean seed and most food products, while daidzein and 
glycitein forms are present in lower levels (Murphy, P.A. (1999) J. Agric. Food 
Chem. 47:2697-2704). 

The biosynthetic pathway for isoflavonoids in soybean and their relationship 
with several other classes of phenylpropanoids is presented in Figure 1 A and 

20 Figure 1 B. 

Though the branch initiated by isoflavone synthase that leads to synthesis of 
isoflavonoids is mainly limited to the legumes, the phenylpropanoid pathway and 
other branches occur in other plant species. In maize, genes of the 
phenylpropanoid pathway and the lower anthocyanin branch are regulated by the 
25 transcription factor C1 in combination with an R-type factor. Together C1 and an R- 
type factor activate expression of a set of genes that leads to the synthesis and 
accumulation of anthocyanins in maize cells (Grotewold, E., et al. (1998) Plant Cell 
70:721-740). 

Maize C1 is a myb-type transcription factor that regulates expression of genes 
30 involved in anthocyanin production and accumulation in maize cells. However C1 
cannot activate gene expression alone, and requires interaction with an R-type myc 
transcription factor for activation of target gene promoters. The R-type factors 
include, among others, alleles of R, alleles of the homologous B gene of maize, and 
alleles of the homologous Lc gene. These genes function similarly and make up the 
35 R/B gene family (Goff, S.A., et al. (1992) Genes Dev. 6:864-875). The various 
genes of the R/B gene family may be in turn each found as diverging alleles that 
fluctuate in expression pattern within the corn plant due to differences in their 
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promoters. The members of this family encode proteins with very similar amino acid 
sequences and thus have comparable effects on the anthocyanin pathway structural 
genes. The specificity of the different promoters provides tissue specificity of 
anthocyanin biosynthesis (Radicella, J.P. et al. (1992) Genes Dev. 6:2152-2164; 
5 Walker, E.L. (1995) EMBO J. 14:2350-2363). The skilled artisan will recognize that 
the coding region of any functional gene of this large family could be used in 
conjunction with a promoter of choice to obtain R-gene function in the desired tissue 
or developmental stage. Examples of R/B family genes and alleles include, but are 
not limited to, Lc, R, R-S, R-P, Sn, B-Peru, and B-l. The coding regions of particular 

10 alleles of the Lc or B genes, especially the B-Peru allele, have been most commonly 
used in experiments in conjunction with C1. 

Cell suspension lines of the maize inbred Black Mexican Sweet (BMS) that 
harbored an estradiol-inducible version of a fusion of C1 and R (CRC) were 
analyzed after the addition of estradiol. The cDNA fragments from the known 

15 flavonoid genes, except chalcone isomerase, were induced in the CRC-expressing 
line after hormone induction (Bruce et al. (2000) Plant Cell 72:65-80). Maize C1 and 
an R-type factor together can promote the synthesis of anthocyanins in Arabidopsis 
tissues that do not naturally express anthocyanins (Lloyd, A.M., et al. (1992) 
Science 258:1773-1775), and in petunia leaves (Quattrocchio, F., et al. (1993) Plant 

20 Cell 5:1497-1512). 

WO 99/37794, published July 29, 1999, discloses the expression of maize C1 
and the Lc allele of R in tomato fruit which led to increased levels of the flavonol 
kaempferol. Thus, it is known that C1 and an R-type factor can regulate expression 
of individual genes of the phenylpropanoid pathway in plants including Arabidopsis, 

25 petunia, tomato, and maize leading to production of anthocyanins or flavonols. 
These are all plants that do not produce isoflavones. Isoflavone production is 
almost exclusively limited to the legumes. An example of one of the few non- 
legume plants that does produce isoflavones is sugar beet. 

C1 and B-Peru were transiently expressed in white clover and pea, which are 

30 legumes, (Majnik, et al. (1998) Aust. J. Plant Phys. 25:335-343) and anthocyanin 
levels assayed by visual inspection. Transient expression of C1 and B-Peru did 
result in production of anthocyanin in several tissues of white clover and pea. No 
assay was performed to determine any effect of C1 and B-Peru on isoflavonoid 
levels. Thus, any effects of C1 and an R-type myc on isoflavonoid levels in 

35 isoflavonoid-producing plants has not been taught. 

WO 00/44909, published August 3, 2000, discloses transformation of 
soybeans with maize C1 and R (as a CRC chimera) in conjunction with 



3 



WO 02/101023 



PCT/US02/21107 



overexpression of the isoflavone synthase gene. Any effects of CRC alone on 
levels of isoflavonoids have not been reported. Thus, it is not known whether 
introduction of C1 and an R-type factor alone, without isoflavone synthase, could 
have any effect on the synthesis and accumulation of isoflavonoids in 
5 isoflavonoid-producing plants. 

The physiological benefits associated with isoflavonoids in both plants and 
humans make the manipulation of their contents in crop plants highly desirable. For 
example, increasing levels of isoflavonoids in soybean seeds would increase the 
efficiency of extraction and lower the cost of isoflavone-related products sold today 

10 for use in either reduction of serum cholesterol or in estrogen replacement therapy. 
In addition to altering the levels of total isoflavonoids, altering the ratios of 
individual isoflavonoid components is of interest. There is some indication that 
genistein and daidzein have individual effects in plant disease response and on 
human health. While daidzein is the precursor to the major phytoalexins of 

15 soybean, the glyceollins, genistein is involved in establishing the cell response to 
pathogen attack so that glyceollins are synthesized (Graham and Graham (2000) 
Mol. Plant Microbe Interact 5:181-219). In human health, daidzein is effective in 
reducing levels of LDL-cholesterol and increasing the levels of HDL-cholesterol in 
human blood (US Patent No. 5,855,892). Daidzein is also effective for the treatment 

20 of hypertension and coronary atherosclerotic heart disease (Liu, Y., et al. (1990) 
Shenyang Yaoxueyuan Xuebao 7:123-125). Thus, raising the daidzein component 
in the total isoflavonoids could be valuable. 

Therefore there is a need to enhance the level of isoflavonoids and to alter the 
ratios of isoflavonoid components in isoflavonoid-producing plants. 

25 SUMMARY OF THE INVENTION 

This invention concerns a method of altering the isoflavonoid profile of an 
isoflavonoid-producing plant, said method comprising: 

(a) transforming a plant with (i) a first recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 

30 a C1 myb transcription factor and a second recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, (ii) a recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 

35 acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 
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an R myc-type transcription factor wherein said construct is capable of functioning 
as both a C1 myb transcription factor and an R myc-type transcription factor; and 

(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; wherein 
5 expression of the construct or constructs alters the isoflavonoid profile of the 
transformed plant by increasing the total daidzein to total genistein ratio compared 
to the total daidzein to total genistein ratio of a control. 

In a second embodiment, the recombinant expression construct described 
above comprises a promoter operably linked to an isolated nucleic acid fragment 
10 encoding a chimeric transcription factor comprising a maize R myc-type coding 
region situated between the C1 DNA binding domain and the C1 activation domain. 

In a third embodiment, the isoflavonoid-producing plant is selected from the 
group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, lupine, 
sugar beet, and snow pea. Also of interest are seed or plant parts of a plant 
15 transformed with a recombinant expression construct of the invention from which 
isoflavonoid-containing products can be obtained or extracted. 

In a fourth embodiment, this invention concerns a food or beverage 
incorporating these isoflavonoid-containing products. 

In a fifth embodiment, this invention concerns a method of producing an 
20 isoflavonoid-containing product which comprises: (a) cracking the seeds obtained 
from plants transformed with any of the recombinant expression constructs of the 
invention to remove the meats from the hulls; and (b) flaking the meats obtained in 
step (a) to obtain the desired flake thickness. 

In a sixth embodiment, this invention concerns an isoflavonoid-producing plant 
25 comprising in its genome 

(i) a first recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a second recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding an R myc-type transcription 

30 factor, 

(ii) a recombinant expression construct comprising a promoter operably linked 
to an isolated nucleic acid fragment encoding a C1 myb transcription factor and a 
promoter operably linked to an isolated nucleic acid fragment encoding an R myc- 
type transcription factor, or 

35 (iii) a recombinant expression construct comprising a promoter operably linked 

to an isolated nucleic acid fragment encoding all or part of a C1 myb transcription 
factor and all or part of an R myc-type wherein said construct is capable of 
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functioning as both a C1 myb transcription factor and an R myc-type transcription 
factor; 

wherein said plant has an increased total daidzein to total genistein ratio when 
compared to the total daidzein to total genistein ratio of a control. 

5 

BIOLOGICAL DEPOSIT 
The following plasmid has been deposited under the terms of the Budapest 
Treaty with the American Type Culture Collection (ATCC), 10801 University 
Boulevard, Manassas, VA 201 10-2209, and bears the following designation, 
10 accession number and date of deposit. 

Plasmid Accession Number Date of Deposit 

pDP7951 PTA371 07/29/1999 

BRIEF DESCRIPTION OF THE 
DRAWINGS AND SEQUENCE LISTINGS 
15 The invention can be more fully understood from the following detailed 

description and the accompanying Sequence Listing which form a part of this 
application. 

Figure 1A and Figure 1B depict the soybean biosynthetic pathway for 
isoflavonoids and their relationship with several other classes of phenylpropanoids. 

20 Figure 1 A shows the pathway from phenylalanine to daidzein, genistein, and 
dihydroflavonol. Figure 1B shows the pathway from daidzein, genistein, and 
dihydroflavonol to glyceollins, kievitone, anthocyanins, and flavonols. 

Figure 2 depicts the total daidzein to total genistein ratios observed for 
individual R1 seeds from plants obtained from four independent transformation 

25 events showing novel total daidzein to total genistein ratios and from control seeds. 
The source of the seed for each group (i.e. CRC transformation event number or 
control) is indicated above the bars. Control seeds are obtained either from a plant 
which was subject to bombardment and not found to contain the nucleic acid 
fragment of interest or from plants transformed with a recombinant DNA expression 

30 construct that does not alter the isoflavonoid profile of the transformed plant. Seeds 
1,2, 5,6,7, 8, 9, 10, 11, 14, 15, 16, 17,21,22, 23,24, 25, 26, 27,28,29,31,35, 
36, 37, 39, 40, and 43 are from plants resulting from transformation experiments 
that, during PCR amplification, were negative for the CRC recombinant expression 
construct Seeds numbered 1 through 7 in Figure 2 of the provisional application 

35 correspond to those numbered 3, 20, 8, 41, 13, 30, and 38 in this figure. 
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Figure 3 depicts the total of isoflavone levels for individual R1 seeds obtained 
from plants from four independent transformation events showing novel total 
daidzein to total genistein ratios and from control seeds. The seeds in this figure 
are the same as those in Figure 2. The source of the seeds for each group (i.e. 
5 control or event number) is indicated above the bars. 

Figure 4 depicts the total daidzein to total genistein ratios observed for single 
R2 seed from field-grown transgenic plants derived from CRC recombinant 
expression construct lines and from wild type segregants (indicated by an asterisk 
[*] on the figure). The seed with novel total daidzein to total genistein ratios also 
10 showed a brown stripe along the median. The source (CRC transformation event 
number) of the seed for each group is indicated above the bars. 

Figure 5 depicts the total daidzein to total genistein ratios observed for single 
R2 seed from a plant derived from the 1-1 transformation event and grown in a 
growth room. Seed without a brown stripe along the median are indicated with a 
15 pound sign (#) above the bars while unmarked bars represent seed with a brown 
stripe along the median. 

Figure 6 depicts the total isoflavone levels observed for single R2 seed from 
field-grown transgenic plants derived from CRC recombinant expression construct 
lines and from wild type segregants (indicated by an asterisk [*] on the figure). The 
20 source of the seed for each group (CRC transformation event number) is indicated 
above the bars. 

Figure 7 depicts the total isoflavone levels in single R2 seed obtained from a 
plant grown in a growth room and derived from the 1-1 transformation event. Seed 
wirhout a brown stripe along the median are indicated with a pound sign (#) above 

25 the bars while the other, unmarked bars represent seed with a brown stripe. These 
represent the same individual seeds as in Figure 5. 

Figure 8 depicts the total daidzein to total genistein ratios of bulk-analyzed R3 
seed harvested from plants grown in a growth room. Each bulk seed sample is from 
a separate plant. The CRC recombinant expression construct line (i.e. CRC 

30 transformation event number) for each seed sample is indicated above the bars. 
Seed samples from wild type segregants derived from the CRC recombinant 
expression construct lines are indicated by an asterisk [*] above the bars. 

Figure 9 depicts the total isoflavone levels of bulk-analyzed R3 seed harvested 
from plants grown in a growth room. Each bulk seed sample is from a separate 

35 plant. The CRC recombinant expression construct line (i.e. CRC transformation 
event number) for each seed sample is indicated above the bars. Seed samples 
from wild type segregants derived from the CRC recombinant expression construct 
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lines are indicated by an asterisk [*] on the figure. The seed are the same as those 
analyzed for Figure 8. 

Figure 10 depicts the totals of individual isoflavones (daidzein, glycitein, and 
genistein) as well as the total isoflavones obtained from HPLC analyses of extracts 
5 prepared from individual R1 seeds obtained from plants transformed with the CRC 
recombinant DNA expression construct. Three to five seeds were analyzed from 
each plant. The control seeds are obtained from a transformant negative for the 
CRC recombinant DNA expression construct. Seeds obtained from plants positive 
for the CRC recombinant DNA expression construct are from individual 

10 transformation events 1-1 , 1-2, 1-25, and 1-35. The source of the seeds for each 
group (transformation event followed by plant number) is indicated above the bars. 

Figure 1 1 depicts the ratios of total daidzein to the total isoflavones obtained 
for the same R1 seeds transformed with the CRC recombinant DNA expression 
construct and analyzed in Figure 10. 

15 Figure 1 2 depicts the ratios of total genistein to the total isoflavones obtained 

for the same R1 seeds transformed with the CRC recombinant DNA expression 
construct and analyzed in Figure 10. 

Figure 13 depicts the ratios of total daidzein to the total isoflavones obtained 
for individual R2 seeds from plants grown in the growth room and derived from three 

20 different transformation events (1-1 , 1-2, and 1-25). The ratios obtained for six 

seeds from each plant are shown. The individual plants from which the seeds were 
harvested are identified with a number letter combination above the bars. The first 
two numbers designate the transformation event number, the third number 
designates the R0 plant, and the letter designates the R1 plant from which the R2 

25 seed were obtained. Seeds not having a brown stripe along the median are 
indicated with a pound sign (#) above the bar. 

Figure 14 depicts the ratios of total genistein to the sum of all isoflavones 
obtained for individual R2 seeds from plants grown in the growth room and derived 
from three different transformation events (1-1, 1-2, and 1-25). The ratios shown 

30 are for the same seed as shown in Figure 13. 

Figure 15 depicts the ratios of total daidzein to the total isoflavones obtained 
for individual R2 seeds from plants grown in the field and derived from three 
different transformation events (1-1 , 1-2, and 1-25). Each set of 2 seeds labeled 
with an asterisk (*) above the bar are tan seed from a segregant producing only tan 

35 seed, thereby identified as a wt segregant, of the transformation event of the 

adjacent plants. Three seeds all having a brown stripe were assayed from each of 2 
CRC recombinant DNA expression construct -containing plants from each of the 3 
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transformation events. The individual CRC recombinant DNA expression construct- 
containing plants from which the seed were harvested are identified with three 
numbers. The first two numbers designate the transformation event number and the 
third number designates the R0 plant from which the R2 seed were obtained. 
5 Figure 16 depicts the ratios of total genistein to the sum of all isoflavones 

obtained for individual R2 seeds from plants grown in the field and derived from 
three different transformation events (1-1 , 1-2, and 1-25). The ratios shown are for 
the same seeds as shown in Figure 15. 

Figure 17 depicts the total of isoflavones obtained for individual R2 seeds from 

10 plants grown in the field and derived from three different transformation events (1-1 , 
1-2, and 1-25). The seeds are the same as those analyzed for Figure 1 1 . 

Figure 18 depicts an LC-MS2 mass chromatogram of m/z 504.6 to 505.6 
obtained from extracts from a control wild type segregant seed without the CRC 
recombinant DNA expression construct. 

15 Figure 19 depicts an LC-MS2 mass chromatogram of m/z 504.6 to 505.6 

obtained from extracts from brown striped R3 seed derived from the 1-1 
transformation event. Additional peaks at 14.38, 15.46, 21.29, and 21.75 minutes 
are seen. 

The following sequence descriptions and Sequences Listing attached hereto 
20 comply with the rules governing nucleotide and/or amino acid sequence disclosures 
in patent applications as set forth in 37 C.F.R. §1.821-1.825. The Sequence Listing 
contains the one letter code for nucleotide sequence characters and the three letter 
codes for amino acids as defined in conformity with the IUPAC-IUB standards 
described in Nucleic Acids Research 1 3:3021 -3030 (1985) and in the Biochemical 
25 Journal 219 (No. 2):345-373 (1 984) which are herein incorporated by reference. 
The symbols and format used for nucleotide and amino acid sequence data comply 
with the rules set forth in 37 C.F.R. §1.822. 

SEQ ID NO:1 is the nucleotide sequence of primer 1 used for detection of the 
CRC recombinant DNA fragment. 
30 SEQ ID NO:2 is the nucleotide sequence of primer 2 used for detection of the 

CRC recombinant DNA fragment. 

SEQ ID NO:3 is the nucleotide sequence of primer 3 used for the detection of 
genomic and chimeric isoflavone synthase genes. 

SEQ ID NO:4 is the nucleotide sequence of primer 4 used for the detection of 
35 genomic and chimeric isoflavone synthase genes. 
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SEQ ID NO:5 is the nucleotide sequence of the cDNA insert in clone 
sdp3c.pk002.c22 encoding at least a portion of a soybean phenylalanine ammonia 
lyase. 

SEQ ID NO:6 is the nucleotide sequence of the cDNA insert in clone 
5 src3c.pk014.e17 encoding at least a portion of a soybean cinnamic acid 4- 
hydroxylase. 

SEQ ID NO:7 is the nucleotide sequence of the cDNA insert in clone 
ssm.pk0013.e3 encoding at least a portion of a soybean chalcone isomerase. 

SEQ ID NO:8 is the nucleotide sequence of the cDNA insert in clone 
10 src3c.pk009.e4 encoding at least a portion of a soybean chalcone reductase. 

SEQ ID NO:9 is the nucleotide sequence of the cDNA insert in clone pOY204 
encoding at least a portion of a soybean isoflavone synthase. 

SEQ ID NO: 10 is the nucleotide sequence of the cDNA insert in clone 
sfh .pk0040.g1 1 encoding at least a portion of a soybean flavanone 3-hydroxylase 
15 SEQ ID NO:11 is the nucleotide sequence of the cDNA insert in clone 

sfl1.pk131.g5 encoding a portion of a soybean dihydroflavonol reductase. 

SEQ ID NO:12 is the nucleotide sequence of the cDNA insert in clone 
src.pk0043.d1 1 encoding at least a portion of a soybean dihydroflavonol reductase. 

SEQ ID NO: 13 is the nucleotide sequence of the cDNA insert in clone 
20 ssl.pk0057.d12 encoding at least a portion of a soybean flavonol synthase. 

SEQ ID NO:14 is the nucleotide sequence of the cDNA insert in clone 
srr1c.pk001.k4 encoding at least a portion of a soybean isoflavone reductase. 

SEQ ID NO: 15 is the nucleotide sequence of primerS used for the preparation 
of an isoflavone synthase sequence by amplification from clone pOY204. 
25 SEQ ID NO: 16 is the nucleotide sequence of primer6 used for the preparation 

of an isoflavone synthase sequence by amplification from clone pOY204. 

DETAILED DESCRIPTION OF THE INVENTION 
All patents, patent applications and publications cited are incorporated herein 
30 by reference in their entirety. 

In the context of this disclosure, a number of terms shall be utilized. 
The term "isoflavonoid(s)"refers to a large group of polyphenolic compounds, 
based on a common diphenylpropane skeleton, which occur naturally in plants. 
This term, as used herein, includes, but is not limited to, the three types of 
35 isoflavones in three different forms: the aglycones, daidzein, genistein and glycitein; 
the glucosides, daidzin, genistin and glycitin; and the malonylglucosides, 6"-0- 
malonyldaidzin, 6"-0-malonylgenistin and 6"-0-malonylglycitin, as well as, the 
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acetylglucoside forms: 6'-0-acetyldaidzin, 6'-0-acetyl genistin, and 6'-0-acetyl 
glycitin that are formed during processing. 

As used herein, "total genistein" refers to the total amount of this isoflavonoid 
regardless of the form. Thus, "total genistein" includes the aglycone form, the 
5 glucoside form, the malonylglucoside form, and other genistein forms. Likewise, 
"total daidzein" refers to the total amount of this isoflavonoid regardless of the form. 
Thus, "total daidzein" includes the aglycone form, the glucoside form, the 
malonylglucoside form, and other daidzein forms, and "total glycitein" includes the 
aglycone form, the glucoside form, the malonylglucoside form, and other glycitein 
10 forms. 

The term "isoflavonoid-producing plant" refers to a plant in which isoflavonoids 
normally occur. 

The term "control" refers to a plant or plant parts, such as seed, which is/are 
used as the basis for comparison. The control plant or plant parts, such as seed, 

15 described herein are plants or plant parts in which the isoflavone profile has not 
been altered. Examples of suitable controls include, but are not limited to, a wild- 
type plant or plant parts obtained from a wild type plant; a plant which was subject to 
bombardment and not found to contain the nucleic acid fragment or fragments of 
interest or a plant part, such as a seed or seeds, obtained from such a transformed 

20 plant; a control plant or plant part can be one derived from a transformed plant that 
contains the nucleic acid fragment or fragments of interest, but it does not now 
contain the nucleic acid fragment or fragments of interest due to segregation of the 
fragments(s) during sexual reproduction (this can be referred to as a wild-type 
segregant); or a control plant can be a plant transformed with a nucleic acid 

25 fragment that does not alter the isoflavone profile, e.g., a plant transformed to 

produce seeds with a high lysine phenotype but the isoflavone profile would not be 
altered. For example, if the plant of interest is a soybean plant then the preferred 
control would be seeds obtained from one of the plants described above. If the 
plant of interest is clover, then the preferred control would be leaves obtained from 

30 one of the plants described above. Those skilled in the art will appreciate that a 
particular control will depend upon the plant of interest. 

The term "C1 myb transcription factor" refers to a protein encoded by a maize 
C1 gene and to any protein which is functionally equivalent to a C1 myb 
transcription factor. 

35 The term "R myc-type transcription factor" refers to a protein with a basic helix- 

loop-helix domain encoded by a member of the R/B gene family and to any protein 
that is functionally equivalent to an R myc-type transcription factor. 
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As used herein, an "isolated nucleic acid fragment" is a polymer of RNA or 
DNA that is single- or double-stranded, optionally containing synthetic, non-natural 
or altered nucleotide bases. An isolated nucleic acid fragment in the form of a 
polymer of DNA may be comprised of one or more segments of cDNA, genomic 
5 DNA or synthetic DNA. Nucleotides (usually found in their 5'-monophosphate form) 
are referred to by their single letter designation as follows: "A" for adenylate or 
deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deoxycytidylate, 
"G" for guanylate or deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" 
for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, 

10 "I" for inosine, and "N" for any nucleotide. 

The terms "subfragment that is functionally equivalent" and "functionally 
equivalent subfragment" are used interchangeably herein. These terms refer to a 
portion or subsequence of an isolated nucleic acid fragment in which the ability to 
alter gene expression or produce a certain phenotype is retained whether or not the 

15 fragment or subfragment encodes an active enzyme. For example, the fragment or 
subfragment can be used in the design of recombinant DNA fragments or chimeric 
genes to produce the desired phenotype in a transformed plant. 

The terms "homology", "homologous", "substantially similar" and 
"corresponding substantially" are used interchangeably herein. They refer to nucleic 

20 acid fragments wherein changes in one or more nucleotide bases do not affect the 
ability of the nucleic acid fragment to mediate gene expression or produce a certain 
phenotype. These terms also refer to modifications of the nucleic acid fragments of 
the instant invention such as deletion or insertion of one or more nucleotides that do 
not substantially alter the functional properties of the resulting nucleic acid fragment 

25 relative to the initial, unmodified fragment. It is therefore understood, as those 
skilled in the art will appreciate, that the invention encompasses more than the 
specific exemplary sequences. 

Moreover, the skilled artisan recognizes that substantially similar nucleic acid 
sequences encompassed by this invention are also defined by their ability to 

30 hybridize, under moderately stringent conditions (for example, 0.5 X SSC, 0.1 % 
SDS, 60°C) with the sequences exemplified herein, or to any portion of the 
nucleotide sequences disclosed herein and which are functionally equivalent to any 
of the nucleic acid sequences disclosed herein. Stringency conditions can be 
adjusted to screen for moderately similar fragments, such as homologous 

35 sequences from distantly related organisms, to highly similar fragments, such as 
genes that duplicate functional enzymes from closely related organisms. Post- 
hybridization washes determine stringency conditions. One set of preferred 
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conditions involves a series of washes starting with 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, 
and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 30 min. A more 
preferred set of stringent conditions involves the use of higher temperatures in 
5 which the washes are identical to those above except for the temperature of the final 
two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60°C. Another 
preferred set of highly stringent conditions involves the use of two final washes in 
0.1X SSC, 0.1% SDS at 65°C. 

Sequence alignments and percent similarity calculations may be determined 

10 using a variety of comparison methods designed to detect homologous sequences 
including, but not limited to, the Megalign program of the LASARGENE 
bioinformatics computing suite (DNASTAR Inc., Madison, Wl). Multiple alignment of 
the sequences are performed using the Clustal method of alignment (Higgins and 
Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, 

15 GAP LENGTH PENALTY=10). Default parameters for pairwise alignments and 
calculation of percent identity of protein sequences using the Clustal method are 
KTUPLE=1 , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For 
nucleic acids these parameters are KTUPLE=2, GAP PENALTY=5, WINDOW=4 
and DIAGONALS SAVED=4. 

20 "Gene" refers to a nucleic acid fragment that expresses a specific protein, 

including regulatory sequences preceding (5' non-coding sequences) and following 
(3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as 
found in nature with its own regulatory sequences. "Chimeric gene" refers any gene 
that is not a native gene, comprising regulatory and coding sequences that are not 

25 found together in nature. Accordingly, a chimeric gene may comprise regulatory 
sequences and coding sequences that are derived from different sources, or 
regulatory sequences and coding sequences derived from the same source, but 
arranged in a manner different than that found in nature. A "foreign gene" refers to 
a gene not normally found in the host organism, but that is introduced into the host 

30 organism by gene transfer. Foreign genes can comprise native genes inserted into 
a non-native organism, or chimeric genes. A "transgene" is a gene that has been 
introduced into the genome by a transformation procedure. An "allele" is one of 
several alternative forms of a gene occupying a given locus on a chromosome. 
When all the alleles present at a given locus on a chromosome are the same that 

35 plant is homozygous at that locus. If the alleles present at a given locus on a 
chromosome differ that plant is heterozygous at that locus. 
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"Coding sequence" refers to a DNA sequence that codes for a specific amino 
acid sequence. "Regulatory sequences" refer to nucleotide sequences located 
upstream (5' non-coding sequences), within, or downstream (3' non-coding 
sequences) of a coding sequence, and which influence the transcription, RNA 
5 processing or stability, or translation of the associated coding sequence. Regulatory 
sequences may include, but are not limited to, promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the expression of 
a coding sequence or functional RNA. The promoter sequence consists of proximal 

10 and more distal upstream elements, the latter elements often referred to as 

enhancers. Accordingly, an "enhancer" is a DNA sequence which can stimulate 
promoter activity and may be an innate element of the promoter or a heterologous 
element inserted to enhance the level or tissue-specificity of a promoter. Promoters 
may be derived in their entirety from a native gene, or be composed of different 

15 elements derived from different promoters found in nature, or even comprise 

synthetic DNA segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental 
conditions. Promoters which cause a gene to be expressed in most cell types at 

20 most times are commonly referred to as "constitutive promoters". New promoters of 
various types useful in plant cells are constantly being discovered; numerous 
examples may be found in the compilation by Okamuro, J. K., and Goldberg, R. B. 
(1989) Biochemistry of Plants 75:1-82. It is further recognized that since in most 
cases the exact boundaries of regulatory sequences have not been completely 

25 defined, DNA fragments of some variation may have identical promoter activity. 

The "translation leader sequence" refers to a polynucleotide sequence located 
between the promoter sequence of a gene and the coding sequence. The 
translation leader sequence is present in the fully processed mRNA upstream of the 
translation start sequence. The translation leader sequence may affect processing 

30 of the primary transcript to mRNA, mRNA stability or translation efficiency. 

Examples of translation leader sequences have been described (Turner, R. and 
Foster, G. D. (1995) Mol. Biotech. 3:225-236). 

The "3' non-coding sequences" refer to DNA sequences located downstream 
of a coding sequence and include polyadenylation recognition sequences and other 

35 sequences encoding regulatory signals capable of affecting mRNA processing or 
gene expression. The polyadenylation signal is usually characterized by affecting 
the addition of polyadenylic acid tracts to the 3" end of the mRNA precursor. The 
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use of different 3' non-coding sequences is exemplified by Ingelbrecht, I. L, et al. 
(1989) Plant Cell 1:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
5 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary transcript 
or it may be an RNA sequence derived from post-transcriptional processing of the 
primary transcript and is referred to as the mature RNA. "Messenger RNA (mRNA)" 
refers to the RNA that is without introns and that can be translated into protein by 

10 the cell. "cDNA" refers to a DNA that is complementary to and synthesized from a 
mRNA template using the enzyme reverse transcriptase. The cDNA can be single- 
stranded or converted into the double-stranded form using the Klenow fragment of 
DNA polymerase I. "Sense" RNA refers to RNA transcript that includes the mRNA 
and can be translated into protein within a cell or in vitro. "Antisense RNA" refers to 

15 an RNA transcript that is complementary to all or part of a target primary transcript 
or mRNA and that blocks the expression of a target gene (U.S. Patent 
No. 5,107,065). The complementarity of an antisense RNA may be with any part of 
the specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding 
sequence, introns, or the coding sequence. "Functional RNA" refers to antisense 

20 RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect 
on cellular processes. The terms "complement" and "reverse complement" are used 
interchangeably herein with respect to mRNA transcripts, and are meant to define 
the antisense RNA of the message. 

The term "operably linked" refers to the association of nucleic acid sequences 

25 on a single nucleic acid fragment so that the function of one is regulated by the 

other. For example, a promoter is operably linked with a coding sequence when it is 
capable of regulating the expression of that coding sequence (i.e., that the coding 
sequence is under the transcriptional control of the promoter). Coding sequences 
can be operably linked to regulatory sequences in a sense or antisense orientation. 

30 In another example, the complementary RNA regions of the invention can be 

operably linked, either directly or indirectly, 5' to the target mRNA, or 3' to the target 
mRNA, or within the target mRNA, or a first complementary region is 5' and its 
complement is 3' to the target mRNA. 

The term "expression", as used herein, refers to the production of a functional 

35 end-product, e.g., an mRNA or a protein (precursor or mature). 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one 
from which any pre- or propeptides present in the primary translation product have 
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been removed. "Precursor" protein refers to the primary product of translation of 
mRNA; i.e., with pre- and propeptides still present. Pre- and propeptides may be 
but are not limited to intracellular localization signals. 

"Stable transformation" refers to the transfer of a nucleic acid fragment into a 
5 genome of a host organism, including both nuclear and organellar genomes, 
resulting in genetically stable inheritance. In contrast, "transient transformation" 
refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing 
organelle, of a host organism resulting in gene expression without integration or 
stable inheritance. Host organisms containing the transformed nucleic acid 

10 fragments are referred to as "transgenic" organisms. The preferred method of cell 
transformation of rice, corn and other monocots is the use of particle-accelerated or 
"gene gun" transformation technology (Klein et al., (1987) Nature (London) 
327:70-73; U.S. Patent No. 4,945,050), or an Agrobacterium-mediaied method 
using an appropriate Ti plasmid containing the transgene (Ishida Y. et al., 1996, 

15 Nature Biotech. 74:745-750). 

Standard recombinant DNA and molecular cloning techniques used herein are 
well known in the art and are described more fully in Sambrook, J., Fritsch, E.F. and 
Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory Press: Cold Spring Harbor, 1989 (hereinafter "Sambrook"). 

20 The term "recombinant" refers to an artificial combination of two otherwise 

separated segments of sequence, e.g., by chemical synthesis or by the 
manipulation of isolated segments of nucleic acids by genetic engineering 
techniques. 

"PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of large 
25 quantities of specific DNA segments, consists of a series of repetitive cycles (Perkin 
Elmer Cetus Instruments, Norwalk, CT). Typically, the double stranded DNA is heat 
denatured, the two primers complementary to the 3' boundaries of the target 
segment are annealed at low temperature and then extended at an intermediate 
temperature. One set of these three consecutive steps is referred to as a cycle. 
30 A "recombinant DNA fragment" refers to an artificial combination of nucleic 

acid fragments that are not found together in nature, e.g. coding sequences and 
non-regulatory sequences. Thus, the difference between a "recombinant DNA 
fragment" and a "recombinant construct" as defined below turns on the presence or 
absence of regulatory sequences in the artificial combination of nucleic acid 
35 sequences. If a regulatory sequence is part of the combination then it is a 

"recombinant construct". If there are no regulatory sequences in the combination, 
then it is a "recombinant DNA fragment" . 
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The terms "recombinant construct", "expression construct", "chimeric 
construct", "construct" and "recombinant expression construct" are used 
interchangeably herein. A recombinant construct comprises an artificial combination 
of nucleic acid fragments, e.g., regulatory and coding sequences that are not found 
5 together in nature. For example, a chimeric construct may comprise at least one 
regulatory sequence and at least one coding sequence that are derived from 
different sources, or regulatory sequences and coding sequences derived from the 
same source, but arranged in a manner different than that found in nature. Such 
construct may be used by itself or may be used in conjunction with a vector. If a 

10 vector is used then the choice of vector is dependent upon the method that will be 
used to transform host cells as is well known to those skilled in the art. For 
example, a plasmid vector can be used. The skilled artisan is well aware of the 
genetic elements that must be present on the vector in order to successfully 
transform, select and propagate host cells comprising any of the isolated nucleic 

15 acid fragments of the invention. The skilled artisan will also recognize that different 
independent transformation events will result in different levels and patterns of 
expression (Jones et al., (1985) EMBO J. 4:2411-2418; De Almeida et al., (1989) 
Mol. Gen. Genetics 218:78-86), and thus that multiple events must be screened in 
order to obtain lines displaying the desired expression level and pattern. Such 

20 screening may be accomplished by Southern analysis of DNA, Northern analysis of 
mRNA expression, immunoblotting analysis of protein expression, or phenotypic 
analysis, among others. 

The present invention concerns a method of altering the isoflavonoid profile of 
an isoflavonoid-producing plant, said method comprising: 

25 (a) transforming a plant with (i) a first recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a second recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, (ii) a recombinant expression construct 

30 comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 
acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 

35 an R myc-type transcription factor wherein said construct is capable of functioning 
as both a C1 myb transcription factor and an R myc-type transcription factor; and 
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(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; wherein 
expression of the construct or constructs alters the isoflavonoid profile of the 
transformed plant by increasing the total daidzein to total genistein ratio compared 
5 to the total daidzein to total genistein ratio of a control. 

Also of interest are isoflavonoid-producing plants comprising in their genome 
(i) a first recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a second recombinant expression construct comprising a promoter operably 

10 linked to an isolated nucleic acid fragment encoding an R myc-type transcription 
factor, (ii) a recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a promoter operably linked to an isolated nucleic acid fragment encoding an R 
myc-type transcription factor, or (iii) a recombinant expression construct comprising 

15 a promoter operably linked to an isolated nucleic acid fragment encoding all or part 
of a C1 myb transcription factor and all or part of an R myc-type wherein said 
construct is capable of functioning as both a C1 myb transcription factor and an R 
myc-type transcription factor; wherein said plant has an increased total daidzein to 
total genistein ratio when compared to the total daidzein to total genistein ratio of a 

20 control. 

Examples of isoflavonoid-producing plants include, but are not limited to, 
soybean, clover, mung bean, lentil, hairy vetch, alfalfa, lupine, sugar beet, and snow 
pea. In a more preferred embodiment, the preferred isoflavonoid-producing plant 
would be soybean. Examples of other isoflavonoid-producing plants can be found in 
25 WO 93/23069, published November 25, 1 993, the disclosure of which is hereby 
incorporated by reference. 

Transformation methods are well known to those skilled in the art and are 
described above. 

The recombinant expression constructs which can be used to transform an 
30 isoflavonoid-producing plant fall into one of three categories: 

(1) the constructs can be entirely separate, e.g., one construct may comprise a 
promoter operably linked to an isolated nucleic acid fragment encoding a C1 myb 
transcription factor and another separate construct may comprise a promoter 
operably linked to an isolated nucleic acid fragment encoding an R-myc type 

35 transcription factor; 

(2) a single construct comprising a promoter operably linked to an isolated 
nucleic acid fragment encoding a C1 myb transcription factor and a promoter 
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operably linked to an isolated nucleic acid fragment encoding an R-myc type 
transcription factor; or 

(3) a single construct comprising a promoter operably linked to an isolated 
nucleic acid fragment encoding all or a part of a C1 myb transcription factor and an 
5 isolated nucleic acid fragment encoding all or a part of an R-myc type transcription 
factor such that a fusion protein combining the two encoded proteins is produced. 

The transformed plant is then grown under conditions suitable for the 
expression of the recombinant expression construct or constructs. Expression of 
the recombinant expression construct or constructs alters the isoflavonoid profile of 

10 the transformed plant or plant part by increasing the total daidzein to total genistein 
ratio compared to the total daidzein to total genistein ratio of an untransformed plant 
or plant part. For example, in some cases it may be preferrable to examine 
expression of a recombinant expression construct by comparing seeds obtained 
from a transformed plant with seeds obtained from an untransformed plant to 

15 determine if there has been an increase in the total daidzein to total genistein ratio. 
In a more preferred, embodiment, an isoflavonoid-producing plant can be 
transformed with a recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a chimeric 
transcription factor comprising the maize R coding region situated between the C1 

20 DNA binding domain and the C1 activation domain. 

The regeneration, development and cultivation of plants from single plant 
protoplast transformants or from various transformed explants is well known in the 
art (Weissbach and Weissbach, In.: Methods for Plant Molecular Biology, (Eds.), 
Academic Press, Inc., San Diego, CA (1988)). This regeneration and growth 

25 process typically includes the steps of selection of transformed cells, culturing those 
individualized cells through the usual stages of embryonic development through the 
rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. 
The resulting transgenic rooted shoots are thereafter planted in an appropriate plant 
growth medium such as soil. 

30 The development or regeneration of plants containing the foreign, exogenous 

gene that encodes a protein of interest is well known in the art. Preferably, the 
regenerated plants are self-pollinated to provide homozygous transgenic plants. 
Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown 
plants of agronomically important lines. Conversely, pollen from plants of these 

35 important lines is used to pollinate regenerated plants. A transgenic plant of the 
present invention containing a desired polypeptide is cultivated using methods well 
known to one skilled in the art. 
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There are a variety of methods for the regeneration of plants from plant tissue. 
The particular method of regeneration will depend on the starting plant tissue and 
the particular plant species to be regenerated. 

Methods for transforming dicots, primarily by use of Agrobacterium 
5 tumefaciens, and obtaining transgenic plants have been published for cotton (U.S. 
Patent No. 5,004,863, U.S. Patent No. 5,159,135, U.S. Patent No. 5,518, 908); 
soybean (U.S. Patent No. 5,569,834, U.S. Patent No. 5,416,011, McCabe et. al., 
BiolTechnology 6:923 (1988), Christou et al., Plant Physiol. 87:671-674 (1988)); 
Brassica (U.S. Patent No. 5,463,174); peanut (Cheng et al., Plant Cell Rep. 

10 15:653-657 (1996), McKently et al„ Plant Cell Rep. 14:699-703 (1995)); papaya; 
and pea (Grant et al., Plant Cell Rep. 15:254-258, (1995)). 

Assays for gene expression based on the transient expression of cloned 
nucleic acid constructs have been developed by introducing the nucleic acid 
molecules into plant cells by polyethylene glycol treatment, electroporation, or 

15 particle bombardment (Marcotte et al., Nature 335:454-457 (1988); Marcotte et al., 
Plant Cell 1 :523-532 (1989); McCarty et al., Cell 66:895-905 (1991); Hattori et al., 
Genes Dev. 6:609-618 (1992); Goff et al., EMBO J. 9:2517-2522 (1990)). 

Transient expression systems may be used to functionally dissect gene 
constructs (see generally, Maliga et al., Methods in Plant Molecular Biology, Cold 

20 Spring Harbor Press (1 995)). It is understood that any of the nucleic acid molecules 
of the present invention can be introduced into a plant cell in a permanent or 
transient manner in combination with other genetic elements such as vectors, 
promoters, enhancers etc. 

In addition to the above discussed procedures, practitioners are familiar with 

25 the standard resource materials which describe specific conditions and procedures 
for the construction, manipulation and isolation of macromolecules (e.g., DNA 
molecules, plasmids, etc.), generation of recombinant DNA fragments and 
recombinant expression constructs and the screening and isolating of clones, (see 
for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 

30 Harbor Press (1989); Maliga et al., Methods in Plant Molecular Biology, Cold Spring 
Harbor Press (1995); Birren et al., Genome Analysis: Detecting Genes, 1, Cold 
Spring Harbor, New York (1998); Birren et al., Genome Analysis: Analyzing DNA, 2, 
Cold Spring Harbor, New York (1998); Plant Molecular Biology: A Laboratory 
Manual, eds. Clark, Springer, New York (1997)). 

35 Any promoter can be used in the method of the invention. Thus, the origin of 

the promoter chosen to drive expression of the coding sequence is not critical as 
along as it has sufficient transcriptional activity to accomplish the invention by 
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expressing translatable mRNA for the desired protein genes in the desired host 
tissue. In a preferred embodiment, the promoter is a seed-specific promoter. 
Examples of a seed-specific promoter include, but are not limited to, the promoter 
for p-conglycinin (Chen et al. (1989) Dev. Genet. 10: 1 12-122), the napin and 
5 phaseolin promoters. A plethora of promoters are described in WO 00/18963, 
published on April 6, 2000, the disclosure of which is hereby incorporated by 
reference. 

Also within the scope of this invention are seeds or plant parts obtained from 
such transformed plants. Plant parts include differentiated and undifferentiated 
10 tissues, including but not limited to, roots, stems, shoots, leaves, pollen, seeds, 

tumor tissue, and various forms of cells and culture such as single cells, protoplasts, 
embryos, and callus tissue. The plant tissue may be in plant or in organ, tissue or 
cell culture. 

In another aspect, this invention concerns an isoflavonoid-containing product 

15 high in total daidzein and low in total genistein obtained from the seeds or plant 
parts obtained from the transformed plants described herein. Examples of such an 
isoflavonoid-containing product include, but are not limited to, protein isolate, protein 
concentrate, meal, grits, full fat and defatted flours, textured proteins, textured 
flours, textured concentrates and textured isolates. In still another aspect, this 

20 invention concerns an isoflavonoid-containing product high in total daidzein and low 
in total genistein extracted from the seeds or plant parts obtained from the 
transformed plants described herein. An extracted product could then used in the 
production of pills, tablets, capsules or other similar dosage forms made to contain a 
high concentration of isoflavones. 

25 Methods for obtaining such products are well known to those skilled in the art. 

For example, in the case of soybean, such products can be obtained in a variety of 
ways. Conditions typically used to prepare soy protein isolates have been 
described by [Cho, et al, (1981) U.S. Patent No. 4,278,597; Goodnight, et al. (1978) 
U.S. Patent No. 4,072,670]. Soy protein concentrates are produced by three basic 

30 processes: acid leaching (at about pH 4.5), extraction with alcohol (about 55-80%), 
and denaturing the protein with moist heat prior to extraction with water. Conditions 
typically used to prepare soy protein concentrates have been described by Pass 
[(1975) U.S. Patent No. 3,897,574] and Campbell et al. [(1985) in New Protein 
Foods, ed. by Altschul and Wilcke, Academic Press, Vol. 5, Chapter 10, Seed 

35 Storage Proteins, pp 302-338]. 

"Isoflavone-containing protein products" can be defined as those items 
produced from seed of a suitable plant which are used in feeds, foods and/or 
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beverages. For example, "soy protein products"can include, but are not limited to, 
those items listed in Table 1 . "Soy protein products". 

TABLE 1 

Soy Protein Products Derived from Soybean Seeds 3 
Whole Soybean Products Processed Soy Protein Products 

Roasted Soybeans Full Fat and Defatted Flours 

Baked Soybeans Soy Grits 

Soy Sprouts Soy Hypocotyls 

Soy Milk Soybean Meal 

Soy Milk 

Specialty Soy Foods/Ingredients Soy Protein Isolates 

Soy Milk Soy Protein Concentrates 

Tofu Textured Soy Proteins 

Tempeh Textured Flours and Concentrates 

Miso Textured Concentrates 

Soy Sauce Textured Isolates 

Hydrolyzed Vegetable Protein 

Whipping Protein 

a See Soy Protein Products: Characteristics, Nutritional Aspects and Utilization 
(1987). Soy Protein Council 

"Processing" refers to any physical and chemical methods used to obtain the 
products listed in Table 1 and includes, but is not limited to, heat conditioning, 
flaking and grinding, extrusion, solvent extraction, or aqueous soaking and 
extraction of whole or partial seeds. Furthermore, "processing" includes the 
methods used to concentrate and isolate soy protein from whole or partial seeds, as 
well as the various traditional Oriental methods in preparing fermented soy food 
products. Trading Standards and Specifications have been established for many of 
these products (see National Oilseed Processors Association Yearbook and Trading 
Rules 1991-1992). Products referred to as being "high protein" or "low protein" are 
those as described by these Standard Specifications. "NSI" refers to the Nitrogen 
Solubility Index as defined by the American Oil Chemists' Society Method Ac4 41 . 
"KOH Nitrogen Solubility" is an indicator of soybean meal quality and refers to the 
amount of nitrogen soluble in 0.036 M KOH under the conditions as described by 
Araba and Dale [(1990) Poult. Sci. 69:76-83]. "White" flakes refer to flaked, 
dehulled cotyledons that have been defatted and treated with controlled moist heat 
to have an NSI of about 85 to 90. This term can also refer to a flour with a similar 
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NSI that has been ground to pass through a No. 100 U.S. Standard Screen size. 
"Cooked" refers to a soy protein product, typically a flour, with an NSI of about 20 to 
60. "Toasted" refers to a soy protein product, typically a flour, with an NSI below 20. 
"Grits" refer to defatted, dehulled cotyledons having a U.S. Standard screen size of 
5 between No. 10 and 80. "Soy Protein Concentrates" refer to those products 

produced from dehulled, defatted soybeans by three basic processes: acid leaching 
(at about pH 4.5), extraction with alcohol (about 55-80%), and denaturing the protein 
with moist heat prior to extraction with water. Conditions typically used to prepare 
soy protein concentrates have been described by Pass [(1975) U.S. Patent No. 

10 3,897,574; Campbell et al., (1985) in New Protein Foods, ed. by Altschul and 

Wilcke, Academic Press, Vol. 5, Chapter 10, Seed Storage Proteins, pp 302-338]. 
"Extrusion" refers to processes whereby material (grits, flour or concentrate) is 
passed through a jacketed auger using high pressures and temperatures as a 
means of altering the texture of the material. "Texturing" and "structuring" refer to 

15 extrusion processes used to modify the physical characteristics of the material. The 
characteristics of these processes, including thermoplastic extrusion, have been 
described previously [Atkinson (1970) U.S. Patent No. 3,488,770, Horan (1985) In 
New Protein Foods, ed. by Altschul and Wilcke, Academic Press, Vol. 1A, 
Chapter 8, pp 367-414]. Moreover, conditions used during extrusion processing of 

20 complex foodstuff mixtures that include soy protein products have been described 
previously [Rokey (1983) Feed Manufacturing Technology III, 222-237; McCulloch, 
U.S. Patent No. 4,454,804]. 

Also, within the scope of this invention are food and beverages which have 
incorporated therein an isoflavonoid-containing product of the invention. 

25 The beverage can be a liquid or in a dry powdered form. 

The foods to which the isoflavonoid-containing product of the invention can be 
incorporated/added include almost all foods/beverages. For example, there can be 
mentioned meats such as ground meats, emulsified meats, marinated meats, and 
meats injected with an isoflavonoid-containing product of the invention; nutritional 

30 supplements; beverages such as nutritional beverages, sports beverages, protein 
fortified beverages, juices, milk, milk alternatives, and weight loss beverages; 
cheeses such as hard and soft cheeses, cream cheese, and cottage cheese; frozen 
desserts such as ice cream, ice milk, low fat frozen desserts, and non-dairy frozen 
desserts; yogurts; soups; puddings; bakery products; and salad dressings; and dips 

35 and spreads such as mayonnaise; and chip dips; and food bars. The 

isoflavonoid-containing product can be added in an amount selected to deliver a 
desired dose to the consumer of the food and/or beverage. 
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In still another aspect this invention concerns a method of producing an 
isoflavonoid-containing product which comprises: (a) cracking the seeds obtained 
from transformed plants of the invention to remove the meats from the hulls; and 
(b) flaking the meats obtained in step (a) to obtain the desired flake thickness. 

5 

EXAMPLES 

The present invention is further defined in the following Examples, in which 
parts and percentages are by weight and degrees are Celsius, unless otherwise 
stated. It should be understood that these Examples, while indicating preferred 

10 embodiments of the invention, are given by way of illustration only. From the above 
discussion and these Examples, one skilled in the art can ascertain the essential 
characteristics of this invention, and without departing from the spirit and scope 
thereof, can make various changes and modifications of the invention to adapt it to 
various usages and conditions. Thus, various modifications of the invention in 

15 addition to those shown and described herein will be apparent to those skilled in the 
art from the foregoing description. Such modifications are also intended to fall 
within the scope of the appended claims. 

EXAMPLE 1 

Construction of Plasmids for Transformation of Glycine max 
20 The effect on the isoflavonoid profile of soybean of a protein encoded by a 

recombinant DNA fragment containing maize nucleotide sequences encoding C1 
and the Lc allele of R was tested. For this purpose, plasmid pOY203 was 
constructed for introduction of a CRC recombinant expression construct into 
soybean embryos. Plasmid pOY203 was briefly described in PCT publication 
25 WO 00/44090 (published August 3, 2000) and contains a CRC recombinant DNA 
fragment under the control of the phaseolin promoter and termination signals in a 
vector containing expression systems which allow for selection for growth in the 
presence of hygromycin in both bacterial and plant systems. 

Plasmid pOY203 was prepared through an intermediary plasmid pOY135. 
30 Plasmid pOY135 contains, flanked by Hind II II restriction endonuclease sites, the 
CRC recombinant DNA fragment inserted between the phaseolin promoter and 
polyadenylation signal sequences. The CRC recombinant DNA fragment contains, 
between Sma I sites and in the 5' to 3' orientation, maize nucleotide sequences 
encoding 

35 (a) the C1 myb domain to amino acid 125; 

(b) the entire coding region of the Lc allele of R (amino acids 1 through 160); 

and 
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(c) the C1 transcription activation domain (from amino acid 126 to the 
C-terminus of C1). 

The CRC recombinant DNA fragment was isolated from plasmid pDP7951 
(described in PCT Publication WO 00/44090, published August 3, 2000, and bearing 
5 ATCC deposit No. PTA371 ) and inserted into vector pCW1 08N. Vector pCW1 08N 
is derived from the commercially-available vector pUC18 (Gibco-BRL) and contains 
between Hind III sites: 

(a) a DNA fragment of the phaseolin gene promoter extending from -410 to 
+77 relative to the transcription start site (Slightom et al. (1991 ) Plant Mol. Biol. Man. 

10 B16:1);and 

(b) a 1 175 bp DNA fragment including the polyadenylation signal sequence 
region of the same phaseolin gene (see sequence descriptions in Doyle et al. (1986) 
J. Biol. Chem. 2619228-9238 and Slightom et al. (1983) Proc. Natl. Acad. Sci. USA 
80:1897-1901). 

15 Plasmid pCW1 08N was digested with Asp 71 8, which cuts between the 

phaseolin promoter and polyadenylation signal sequence, and the protruding ends 
filled-in by incubation with T4 DNA polymerase in the presence of dATP, dCTP, 
dGTP, and dTTP. The DNA fragment containing the CRC recombinant DNA 
fragment was isolated from pDP7951 by digestion with Sma I, purified by agarose 

20 gel electrophoresis, and inserted into the blunt-ended pCW108N to create pOY135. 
To create pOY203, a cassette containing the phaseolin promoter/CRC 
recombinant DNA fragment/phaseolin polyadenylation signal sequence (herein 
referred to as CRC recombinant expression construct) was liberated from pOY135 
by digestion with Hind III and introduced into Hind Ill-digested pZBL102. Plasmid 

25 pZBL102 contains expression systems which allow for selection for growth in the 
presence of hygromycin to be used as a means of identifying cells that contain 
plasmid DNA sequences in both bacterial and plant systems and is described in 
PCT Publication WO 00/44090. 

Even though it is not necessary for the practice of the invention, in the original 

30 experiment, plasmid pOY203 was co-bombarded into soybean embryos with 
plasmid pWSJ001 also described in PCT publication WO 00/44090. Plasmid 
pWSJ001 contains the isoflavone synthase coding region under the control of the 
alpha' beta-conglycinin promoter and phaseolin polyadenylation signal sequence in 
a vector containing expression systems which allow for selection for growth in the 

35 presence of hygromycin in both bacterial and plant systems. The isoflavone 
synthase coding region (found in NCBI General Identifier No. 6979520) was 
obtained by PCR amplification of a clone (sgs1c.pk006.o20) obtained from a 
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soybean cDNA library prepared from seeds germinated for 4 hours. Amplification 
was performed using Pfu polymerase (Stratagene) in a standard PGR reaction in a 
GeneAmp PCR System with primer5 (shown in SEQ ID NO:15) and primer6 (shown 
in SEQIDNO:16). 

5 5'-TTG CTG G AACTTG C ACTTG GT-3' [SEQIDNO:15] 

5'-GTATATGATGGGTACCTTAATTAAGAAAGGAG-3' [SEQ ID NO:16] 
The isoflavone synthase coding region was first inserted between the alpha' 
beta-conglycinin promoter and phaseolin polyadenylation signal sequence of vector 
pCW109. Vector pCW109 contains a 550 bp fragment of the alpha' 

10 beta-conglycinin promoter (Slightom et al. (1 991 ) Plant Mol. Biol. Man. B1 6:1 ) and 
the same phaseolin polyadenylation signal sequence described above for 
pCW108N. The Nco I site located between the promoter and polyadenylation signal 
sequence fragments in plasmid pCW109 was eliminated by digestion with Nco I 
followed by fill-in with T4 DNA polymerase in the presence of dATP; dCTP, dGTP 

15 and dTTP. The resulting DNA was digested with Kpn I, which cuts 3' of the filled-in 
Nco I site, and the isoflavone synthase fragment introduced. The cassette 
containing the IFS chimeric gene (alpha' beta-conglycinin promoter/isoflavone 
synthase/phaseolin 3' polyadenylation sequence) was liberated from this plasmid by 
digestion with Hind III and introduced into Hind Ill-digested pZBL102 to form 

20 pWSJOOL 

EXAMPLE 2 

Transformation of Somatic Soybean Embryo Cultures 
and Regeneration of Soybean Plants 
The ability to alter the isoflavone levels in transgenic soybean plants 
25 expressing the CRC recombinant expression construct was tested by transforming 
soybean somatic embryo cultures with plasmids pOY203 and pWSJ001, screening 
for transformants expressing only the CRC recombinant expression construct, 
allowing plants to regenerate, and measuring the levels of isoflavones produced. 
The present invention does not require the presence of plasmid pWSJ001 . 
30 Screening for the presence of the transgenes was performed by PCR amplification, 
and plants containing the isoflavone synthase recombinant-expression construct 
were excluded from this work. 

Soybean embryogenic suspension cultures were transformed with pOY203 in 
conjunction with pWSJ001 by the method of particle gun bombardment, and 
35 transformants carrying the CRC recombinant expression construct in pOY203, and 
not the IFS recombinant expression construct in pWSJOOl , were identified. 
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The following stock solutions and media were used for transformation and 
regeneration of soybean plants: 
Stock Solutions (per Liter): 

MS Sulfate 100x stock: 37.0 g MgS04.7H 2 0, 1 .69 g MnS0 4 .H 2 0, 0.86 g 
5 ZnS04.7H20, 0.0025 g CUSO4.5H2O. 

MS Halides 100x stock: 44.0 g CaCI 2 .2H 2 C\ 0.083 g Kl, 0.00125 g 
CoCI 2 .6H 2 0, 17.0 g KH 2 P0 4> 0.62 g H 3 B0 3i 0.025 g Na 2 MoO4.2H 2 0, 3.724 g 
Na 2 EDTA, 2.784 g FeS0 4 .7H 2 0. 

B5 Vitamin stock: 100.0 g myo-inositol, 1.0 g nicotinic acid, 1.0 g pyridoxine 
10 HCI, 10.0 g thiamine. 

2,4-D stock: 10 mg/mL 
Media (per Liter): 

SB55: 10 mL of each MS stock, 1 mL of B5 Vitamin stock, 0.8 g NH 4 N03 j 
3.033 g KN03_ 1 mL 2,4-D stock, 0.667 g asparagine, pH 5.7. 
15 SB103: 1 pk. Murashige & Skoog salt mixture (Gibco BRL), 60 g maltose, 2 g 

gelrite, pH 5.7. 

SB71-1: B5 salts, 1mL B5 vitamin stock, 30 g sucrose, 750 mg MgCI2, 2 g 
gelrite, pH 5.7. 

Soybean (of the Jack variety) embryogenic suspension cultures were 
20 maintained in 35 mL SB55 liquid media on a rotary shaker (1 50 rpm) at 28°C with a 

mix of fluorescent and incandescent lights providing a 16 hour day, 8 hour night 

cycle. Cultures were subcultured every 2 to 3 weeks by inoculating approximately 

35 mg of tissue into 35 mL of fresh liquid media. 

Soybean embryonic suspension cultures were transformed by the method of 
25 particle gun bombardment (see Klein et al. (1987) Nature 327:70-73) using a 

DuPont Biolistic PDS1000/He instrument. Embryos were co-bombarded with 

plasmid pOY203 (containing the CRC recombinant expression construct) and 

plasmid pWSJ001 (containing the IFS recombinant expression construct). 

Transformants containing the CRC recombinant expression construct alone were 
30 identified by PCR and are described herein. Transformants containing the IFS 

recombinant expression construct were used for other purposes and do not form 

part of the present invention. 

For bombardment, 5 of a 1:2 mixture of pOY203 (0.5 \igl\iL) and pWSJ001 

(1 |ag/|aL) plasmid DNA, 50 uL CaCI 2 (2.5 M), and 20 uL spermidine (0.1 M) were 
35 added to 50 jaL of a 60 mg/mL 0.6 |_im gold particle suspension. The particle 

preparation was agitated for 3 minutes, spun in a microfuge for 10 seconds and the 
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supernatant removed. The DNA-coated gold particles were then washed once with 
400 nL of 100% ethanol, resuspended in 40 jiL of anhydrous ethanol, and sonicated 
three times for 1 second each. Five u.L of the DNA-coated gold particles was then 
loaded on each macro carrier disk. 
5 Approximately 300 to 400 mg of two-week-old suspension culture was placed 

in an empty 60 mm X 15 mm petri dish and the residual liquid removed from the 
tissue using a pipette. The tissue was placed about 3.5 inches away from the 
retaining screen and bombarded twice. Membrane rupture pressure was set at 
1 1 00 psi and the chamber was evacuated to -28 inches of Hg. Two plates were 

10 bombarded for each experiment and, following bombardment, the tissue was 
divided in half, placed back into liquid media, and cultured as described above. 

Eleven days after bombardment, the liquid media was exchanged with fresh 
SB55 media containing 50 mg/mL hygromycin. The selective media was refreshed 
weekly. Seven weeks post bombardment, green, transformed tissue was observed 

15 growing from untransformed, necrotic embryogenic clusters. Isolated green tissue 
was removed and inoculated into individual flasks to generate new, clonally 
propagated, transformed embryogenic suspension cultures. Thus, each new line 
was treated as an independent transformation event. Soybean suspension cultures 
can be maintained as suspensions of embryos clustered in an immature 

20 developmental stage through subculture or can be regenerated into whole plants by 
maturation and germination of individual somatic embryos. 

Transformed embryogenic clusters were removed from liquid culture and 
placed on SB103 solid agar media containing no hormones or antibiotics. Embryos 
were cultured for eight weeks at 26°C with mixed fluorescent and incandescent 

25 lights on a 16 hour day, 8 hour night schedule. During this period, individual 

embryos were removed from the clusters and analyzed at various stages of embryo 
development. Selected lines were assayed by PCR amplification for the presence 
of the CRC recombinant expression construct and/or the IFS recombinant 
expression construct. 

30 5'- AGGCGGAAGAACTGCTGCAACG -3' [SEQ ID NO:1] 

5'- AGGTCCATTTCGTCGCAGAGGC -3' [SEQ ID NO:2] 

5-ATGTTTGGCAAGTAGGAAGGGACC -3' [SEQ ID NO:3] 

5'-GCATTCCATAAGCCGTCACGATTC -3' [SEQ ID NO:4] 

The presence of the CRC recombinant expression construct was determined 
35 using primerl and primer2 (shown in SEQ ID NO:1 and SEQ ID NO:2, respectively) 
which produce a fragment that is not present in wild type soybean embryos. The 
presence of the IFS recombinant expression construct was determined using 
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primer3 and primer4 (shown in SEQ ID NO:3 and SEQ ID NO:4, respectively). 
Separation, on an agarose gel, of the amplification products obtained with this pair 
of primers yielded a 1062 bp fragment indicative of the endogenous IFS gene (i.e., 
containing introns) in all samples and an 845 bp fragment in the embryos also 
5 containing the IFS recombinant expression construct. Embryos containing the CRC 
recombinant expression construct and not the IFS recombinant expression construct 
were selected for further study. 

Somatic embryos became suitable for germination after eight weeks and were 
then removed from the maturation medium and dried in empty petri dishes for 1 to 
10 5 days. The dried embryos were then planted in SB71-1 medium where they were 
allowed to germinate under the same lighting and germination conditions described 
above. Germinated embryos were transferred to sterile soil and grown to maturity. 
Seeds were harvested. 

EXAMPLE 3 

15 Analysis of Isoflavones in R1 Seed of Transformants 

Containing the CRC Recombinant expression construct 
Isoflavone levels were analyzed in seed from soybean primary transformants 
(R1 seed) containing the CRC recombinant expression construct and not the IFS 
recombinant expression construct. Extracts were prepared and analyzed by HPLC 

20 as follows. Each seed was weighed and placed in a 2 ml_ screw cap tube 

containing a %" cylindrical bead and 20 mg flavone (as internal standard). The seed 
was then crushed using a bead beater at 4200 rpm for 30 second intervals until 
reduced to a fine powder. The sample was homogenized into solution by the 
addition of 800 jjLof 80% aqueous methanol and further bead beating. Each 

25 sample was left in a shaking water bath at 60°C for 4 hours and then centrifuged at 
12000 rpm for 10 minutes. A 100 pL aliquot of the supernatant was removed and 
added to 100 pL deionized water, vortexed, centrifuged, and analyzed by HPLC. An 
HP 1 100 instrument equipped with a diode array detector and a Phenomenex, Luna 
3 C18(2), 4.6 mm x 150 mm column was used for HPLC analysis. The column 

30 temperature was 22°C, the solvent flow rate was 1 mL/min, and the detection was 
performed at 260 nm. The solvent elution consisted of a gradient from 5% 
methanol/ 95% 0.1% trifluoroacetic acid (TFA) in water to 100% methanol over 
16 minutes followed by a 3 minute post-run wash. This resulted in chromatograms 
depicting daidzein, glycitein, genistein, and their conjugate derivatives. Standard 

35 curves were constructed with each analysis and individual compounds were 
measured. All of the conjugates were converted to aglycone equivalent values 
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using standard conversion factors. In addition to total concentrations for each 
aglycone, the total isoflavone content was also calculated. 

Generally, five individual seeds from each of one to three plants from each 
transformation event were analyzed. Seeds from primary transformant plants from 
5 a total of 1 3 transformation events, each carrying only the CRC recombinant 
expression construct, were analyzed as well as seed from primary transformant 
plants not carrying a transgene. A subset of the CRC events showed an altered 
isoflavonoid composition as compared to the controls. Observing a phenotype in a 
portion of transgenic plants is explained by the usual variation of expression of a 

10 transgene that occurs in independent transformation events. 

The isoflavone component profile for seeds from a control plant is shown in 
Figure 10, seeds #1-5. This control plant came from a transformation experiment 
but it was PCR negative for the CRC recombinant expression construct. In this 
typical control profile, genistein is the most abundant of the isoflavones. Daidzein is 

15 generally the next highest level component with glycitein lowest, although in some 
seeds the daidzein and glycitein levels can be similar. This example also shows a 
substantial amount of variation in the levels of the individual isoflavones, as well as 
the sum of all isoflavone levels, among individual seeds from the same plant. An 
obvious change in the isoflavone component profile could be seen in seeds 

20 obtained from plants representing four independent transformation events (Figure 
10). The R1 seeds from the hemizygous primary transformants would be expected 
to be segregating for the transgene. Among the seeds analyzed from the 1-1, 1-2, 
and 1-35 event plants there are seeds with an altered profile as well as seeds with 
the control profile described above. All of the seeds from the 1-25 event had an 

25 altered profile indicating that all five seeds contain the transgene. This could be due 
to the presence of multiple segregating loci or due to the selection by chance of five 
single seeds, each containing a single locus. 

The glycitein levels were the least affected in these seeds with altered 
isoflavone components. However in some seeds, particularly of the 1-1 event, the 

30 glycitein levels were increased about two-fold above the level in wild type segregant 
seeds of the same transformant (Figure 10, seeds # 6, 10, 13). The pathway for 
glycitein synthesis is not defined, but may be a part of the daidzein branch due to 
more similarity in glycitein structure with daidzein than with genistein. An enzyme 
encoded by the CYP71 D9 P450 that may be involved in glycitein synthesis was 

35 recently characterized (Latunde-Dada et al. (2001 ) J. Biol. Chem. 276: 1 688-1 695). 
If daidzein and glycitein are closely related, the CRC transgene has the effect of 
activating the daidzein/glycitein branch of isoflavone synthesis. In some individual 
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seeds from CRC transformants having high daidzein levels the total isoflavone 
levels were increased. Out of the 16 seeds with altered isoflavone profiles (shown 
in Figure 10), 14 seeds had higher total isoflavone levels than the seeds from the 
control plant. It may be concluded that the total isoflavone level in individual seeds 
5 is quite variable, but CRC can, in some cases, raise the level further. The 

inconsistency of this effect suggests that there must be other factors that contribute 
to establishing the final total isoflavone levels. 

The altered isoflavone profile of seeds in these four events is distinguished by 
greatly increased levels of total daidzein, the highest level being raised about 

10 four-fold when compared to the daidzein levels in control and wild type segregating 
seeds (seeds #10 and 17). The same individual seeds with high levels of daidzein 
also had greatly decreased levels of genistein, in some instances decreased to 
almost undetectable levels (seeds # 6, 1 1 , 13, for example). These changes result 
in the daidzein component contributing 60% to 80% of the total isoflavones in the 

15 altered phenotype seeds, while daidzein is generally 20% to 35% of the total in 

control and wild type segregating seeds (Figure 11). In the altered phenotype seeds 
the genistein component ranges from a low of almost 0% up to 14%, while in control 
and wild type segregating seeds the range is between 43% and 60% of the total 
isoflavones (Figure 12). The reduction in genistein level varied between the 

20 different transformation events, with event 1-2 having the greatest genistein 

reduction, almost to zero. In event 1-25 genistein was only reduced to between 6% 
and 14% of total isoflavones, this is still much below control levels. 

Figure 2 shows the total daidzein to total genistein ratios for individual seeds 
obtained from plants from the 1-1 , 1-2, 1-25, and 1-35 transformation events having 

25 an increased total daidzein to total genistein ratio as well as from control seeds. 
The control seeds are obtained either from plants resulting from transformation 
experiments that, during PCR amplification, were negative for the CRC recombinant 
expression construct, or from plants transformed with a recombinant DNA 
expression construct that does not alter the isoflavonoid profile. The ratios for two 

30 seeds obtained from the 1-2 event are not shown because their ratios are too high 
(784.0 and 801 .0) to plot on the same chart. While the total daidzein to total 
genistein ratios for control seeds ranged between 0.3 and 1.6, the ratios for the 
seeds from the four transformation events with the novel high total daidzein 
phenotype ranged between 4.7 and 801.0. The exact ratio of total daidzein to total 

35 genistein was variable between individual seeds, even within a single transformation 
event. However, it is clear that expression of the CRC recombinant expression 
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construct in soybean seeds altered the total daidzein to total genistein ratio from 
being less than 2, to being over 4.5. 

Most of the seeds from the four transformation events having increased total 
daidzein to total genistein ratios also had increased levels of total isoflavones. 
5 Figure 3 depicts a graph showing the total isoflavone levels for the same seeds as 
the total daidzein to total genistein ratios are shown for in Figure 2. Of 24 seeds 
analyzed that showed high total daidzein to total genistein ratios, 18 had total 
isoflavone levels higher than the highest total isoflavone values for the controls. 
The control seeds had total isoflavonee levels ranging from 199 to 1833 (Ltg/g seed 

10 weight. Eighteen of the seeds showing increased total daidzein to total genistein 
ratios had total isoflavone levels between 2003 and 4737, while the remaining six 
seeds showing increased total daidzein to total genistein ratios had total isoflavone 
levels between 348 and 1808. Thus, expression of the CRC recombinant 
expression construct in soybean seeds produced higher levels of total isoflavones in 

15 a majority of the seeds having high total daidzein to total genistein ratios. 

EXAMPLE 4 

Analysis of Isoflavones in R2 Seed of Transformants 
Containing the CRC Recombinant expression construct 
R1 seeds from the 1-1 , 1-2, 1-25 and 1-35 events described above were 

20 planted in the field at the Stine location in Newark, DE and R1 seeds from the 1-1 
event were also planted in pots and grown in a growth room. Seeds were harvested 
(R2 seed) and analyzed for isoflavone levels. Single seed extracts were prepared 
and analyzed as described in Example 3 with the following modifications. No 
internal standard was added. Samples were extracted in 80% methanol for 1 hour 

25 at 27°C. After centrifugation, 500 jxL of supernatant was transferred to a fresh 2 ml_ 
tube. An additional 500 jaL of 80% methanol was added to the ground seed left in 
the tube, the mix was resuspended for 30 seconds using a Spex 2000 Geno-grinder 
at 1620 strokes/min, and the centrifugation repeated. Another 500 of 
supernatant was combined with the 500 |j,L in the fresh tube, the sample vortexed, 

30 centrifuged again, and 300 \iL added to 300 yL of deionized H2O and vortexed. 
The sample was assayed by HPLC under the conditions of Example 3 except that 
the column temperature was 25°C, and the detection was at 262 nm. The data was 
calculated as described in Example 3. 

It was noted that individual seeds with the high total daidzein to total genistein 

35 ratio also had a brown stripe along the median of the seed. These seeds had a dark 
brown stripe around the median on the side opposite to the hilum, parallel to the 
cotyledon axis, as opposed to the overall light tan of seeds having a control 
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phenotype. Some of the brown striped seeds were smaller than control seeds and 
some were slightly wrinkled. Cutting the seeds showed that the brown pigmentation 
was only on the external coat and did not extend into the cotyledons. 

To further investigate a possible correlation between the visual phenotype and 
5 isoflavone profile, plants were grown in the growth chamber from either tan or brown 
striped R1 seeds from the 1-1 , 1-2, and 1-25 events and the harvested R2 seeds 
observed. All plants grown from tan seeds produced only tan seeds. The 1-25 
plant grown from a brown stripe seed produced 17 brown striped seeds, consistent 
with there being multiple loci in this line. The 1-1 and 1-2 plants grown from brown 

10 striped seeds all produced segregating brown striped and tan seeds. For the 1-2 
line the segregation was 3:1 brown striped to tan, indicating a dominant trait. For 
the 1-1 line, the segregation ratio was 2:1, suggesting either lower penetrance of the 
trait or a possible association with a recessive seed-lethal phenotype. Isoflavone 
levels were analyzed in individual brown striped and tan seeds from the 1-1 and 1-2 

15 events, as well as brown stripe seeds from the 1-25 event. In every case, the brown 
striped seeds had high daidzein and low genistein, while the tan seeds had the high 
genistein control profile (Figure 13 and Figure 14, respectively). Thus the brown 
stripe cosegregates with an altered isoflavone phenotype in seeds obtained from 
CRC transformants. This visual phenotype provided a means of identifying CRC 

20 homozygotes as well as wild type segregants. 

Thus, seeds with the high total daidzein to total genistein trait could be 
identified visually before analysis. Plants that were wild type segregants from the 
CRC transformation event lines were identified as those plants producing only 
seeds without the brown stripe and the controls for the field-grown R2 seeds were 

25 obtained from these plants. 

The total daidzein to total genistein ratio for single R2 seeds from field-grown 
plants: either plants with no seeds with a brown stripe (wild type segregants) or 
plants with seeds segregating for the brown stripe are shown in Figure 4. The 
transgenic plants expressing the CRC recombinant DNA construct were segregating 

30 for the phenotype but only data for seeds with a brown stripe are shown. The total 
daidzein to total genistein ratios in the wild type segregants were between 0.6 and 
0.7 while the total daidzein to total genistein ratios in seeds having a brown stripe 
along the median of the seed ranged between 2.9 and 128.0. Of the 18 seeds 
having a brown stripe along the median that were analyzed, 16 had total daidzein to 

35 total genistein ratios equal to or greater than 20, while the other two seed had ratios 
of 2.9 and 4.5. Clearly, the high total daidzein to total genistein ratio was inherited 
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in second generation plants of the 1-1, 1-2, and 1-25 events as demonstrated by the 
isoflavone component levels in the R2 field-grown seeds. 

The control seeds for plants grown in the growth room was seed lacking the 
brown stripe along the median and harvested from the same plant as the seed 
5 having the brown stripe along the median of the seed. The total daidzein to total 
genistein ratios for the R2 seeds from growth room plants was obtained and is 
shown in Figure 5. For the R2 seeds from plants of the 1-1 event grown in the 
growth-room the total daidzein to total genistein ratios were much higher than the 
ratios for control seeds. The control seeds had total daidzein to total genistein ratios 

10 between 0.5 and 0.6 while seeds with the brown stripe down the median had total 
daidzein to total genistein ratios between 1 3.6 and 64.4. 

The total isoflavone levels in the R2 seeds from field grown plants were 
measured and are summarized in Figure 6. While some seeds containing the 
brown stripe along the median had total isoflavone levels about two times that of 

15 seeds not having the brown stripe, some of this brown stripe seeds had lower total 
isoflavone levels than the wild type segregant seeds without the brown stripe. Of 
the seeds from plants resulting from the 1-1 transformation event, all of the seeds 
with the brown stripe along the median had total isoflavone levels greater than the 
wild type segregant seeds. Of the seeds from plants resulting from the 1-25 

20 transformation event, the total isoflavone levels of the seeds with the brown stripe 
along the median were greater than the wild type segregant for all but one of the 
seeds analyzed. Of the seeds from plants resulting from the 1-2 transformation 
event, the total isoflavone levels for one of the wild type segregant seeds was higher 
than the usual control range. The total isoflavone level for this seed was higher than 

25 the total isoflavone levels for all of the seed from the 1-2 transformation event 

having the brown stripe along the median. However, all but one of the seeds from 
the 1-2 transformation event and having the brown stripe along the median had total 
isoflavone levels greater than the rest of the wild type segregant seeds (for the 1-1 , 
1-25, and 1-2 events). 

30 The total isoflavone levels in the R2 seeds from plants of the 1-1 

transformation event grown in the growth-room are shown in Figure 7. Seeds 
having the brown stripe along the median had higher total isoflavonw levels than the 
control seeds. 

As shown in Figure 15, the field grown brown striped R2 seeds from all three 
35 events had high daidzein levels, and in general had much reduced genistein, with 
levels around 2%. Even the 1-25 event, which had the least reduced genistein in 
the R1 seed, showed a greater genistein reduction in field grown seeds (Figure 16). 
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Thus variations in the extent of genistein reduction occurred between generations 
and environments. Two individual seeds, one from the 1-1 event and one from the 
1-25 event, are notable in having about 15%-17% genistein. This shows that there 
are also variables that affect the genistein levels even in individual seeds from the 
5 same plant. However, overall R2 seeds having the CRC transgene continued to 
show increased daidzein levels as well as the reduced genistein levels. Also the 
total isoflavone level was increased in some seeds, but again not consistently 
(Figure 17). 

In summary, the total isoflavone levels of second generation seeds were 
10 higher in most instances for seeds having a brown stripe along the median 

(indicative of the higher total daidzein to total genistein ratio and of the presence of 
the CRC recombinant expression construct) than control wild type segregant seeds. 
EXAMPLE 5 

Analysis of Isoflavones in R3 Seed of Transformants 

15 Containing the CRC Recombinant Expression Construct 

Plants were grown in the growth room from R2 seeds harvested from growth 
room grown plants from the 1-1, 1-2, and 1-25 transformation events and seeds 
were harvested (R3) and analyzed for isoflavone content. Extracts were prepared 
and analyzed in bulk samples as follows. Eight seeds from each plant were 

20 combined and ground in a non-commercial grinder. A 200 mg sample was weighed 
and transferred to a 2 mL vial. The sample was then prepared and assayed as 
described in Example 4. The controls for this experiment were R3 seeds from wild 
type segregants producing only non-brown striped seeds. For each transformation 
event one sample was analyzed from each of one control plant and three plants 

25 containing the CRC recombinant expression construct and the results are shown in 
Figure 8. The total daidzein to total genistein ratios in the wild type segregant bulk 
seed samples ranged between 0.7 and 0.8. The total daidzein to total genistein 
ratios in the samples from plants having the CRC recombinant expression construct 
ranged between 5.3 and 71 .8. Clearly, the high total daidzein to total genistein ratio 

30 was inherited in third generation plants of the 1-1 , 1-2, and 1-25 events as 
demonstrated by the isoflavone component levels in the R3 seeds. 

The total isoflavone levels in the bulk R3 samples are shown in Figure 9. In 
the R3 seeds, all bulk seed samples from the plants having the CRC recombinant 
expression construct had total isoflavone levels greater than all of the control plant 

35 seed samples. 
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EXAMPLE 6 

Analysis of the Expression of Genes of the Phenylpropanoid Pathway 
in R4 Seeds of Transformants Containing the 
CRC Recombinant Expression Construct 
5 Northern blot and immunobiot analyses were performed to determine the 

genes in the phenylpropanoid pathway affected by the expression of the CRC 
recombinant expression construct. Probes were prepared to detect mRNA from 
phenylalanine ammonia lyase (PAL), cinnamic acid 4-hydroxylase (C4H), chalcone 
isomerase (CHI), chalcone reductase (CHR), isoflavone synthase (IFS), flavanone 

10 3-hydroxylase (F3H), dihydroflavonol reductase (DFR), flavonol synthase (FS), and 
isoflavone reductase (IFR). RNA was prepared from seed of R3 plants containing 
the CRC recombinant expression construct or from controls, transferred to a 
membrane and hybridized with the probes mentioned above. These Northern blot 
analyses indicated that the levels of PAL, C4H, CHI, CHR, F3H, DFR, and FS were 

15 increased in the seed of transgenic plants expressing the CRC recombinant 

expression construct compared to controls. Immunobiot analyses were performed 
on protein samples derived from seed of the same plants, using anti-CHS, 
anti-CHR, or anti-IFS antisera. The protein expression profiles of CHR and IFS 
genes correlated with their RNA expression profiles. The CHS protein was 

20 increased in seed of CRC transgenic plants, suggesting higher expression of the 
CHS gene. 

Northern Blot analyses 

R3 generation plants of the 1-1 event were grown in the growth chamber. 
Plants homozygous for the CRC recombinant expression construct, producing only 

25 brown-striped seeds, and wild type segregants, producing only tan seeds, were 
grown. Immature seeds were harvested at two stages of development, at 
approximately 10-days-after flowering and 20-days-after flowering weighing 
approximately 150 mg and 250 mg, respectively. Total RNA and protein from these 
materials were extracted separately. For RNA extraction, a modified Trizol method 

30 (Gibco BRL, Life Technologies, Rockville, MD) was applied. Approximately 5 seeds 
for each sample were ground together in liquid nitrogen and 500 mg of the powder 
were extracted with 7.5 mL of Trizol reagent for 5 min. Three mL of chloroform was 
added, mixed, and the 4-ml aqueous phase was collected. The RNA was 
precipitated by the addition of 4 mL of iso-amyl alcohol. After centrifugation and 

35 removal of the liquid phase, the RNA precipitate was washed with 75% ethanol and 
air-dried for 20 min. The RNA was resuspended in 400 pL of water and from each 
sample, an amount equivalent to 30 jj,g of RNA was loaded in each lane of a precast 
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Rilant RNA Gel (FMC, Rockland, ME). The RNA components were separated by 
electrophoresis and transferred to a membrane following standard protocols for 
RNA separation and Northern blotting (Sambrook). 

Probes were prepared from clones identified in the DuPont EST proprietary 
5 database as encoding the desired genes. The sequence of the entire cDNA insert 
in each chosen clone (except srr1c.pk001.k4) was obtained to verify that the insert 
represented the correct gene. The clones used to prepare the probes are shown in 
Table 2 together with the name of the encoded polypeptide and the corresponding 
identifier (SEQ ID NO:) as used in the attached Sequence Listing. 
10 TABLE 2 

Clones Used in the Preparation of Probes for the 
Detection of RNA from Genes of the Phenylpropanoid Pathway 



Clone Encoded Polypeptide SEQ ID NO : 



Sdp3c.pk002.c22 


PAL (phenylalanine ammonia lyase) 


5 


src3c.pk014.e17 


C4H (cinnamic acid 4-hydroxylase) 


6 


ssm.pk0013.e3* 


CHI (chalcone isomerase) 


7 


src3c.pk009.e4 


CHR (chalcone reductase) 


8 


pOY204* 


lFS (isoflavone synthase) 


9 


sfl1.pk0040.g11* 


F3H (flavanone 3-hydroxylase) 


10 


Sfl1.pk131.g5** 


DFR (dihydroflavonol reductase) 


11 


sre.pk0043.d11** 


DFR (dihydroflavonol reductase) 


12 


ssl.pk0057.d12 


FS (flavonol synthase) 


13 


srr1 c.pk001.k4 


IFR (isoflavone reductase) 


14 



* Some of these clones have been described in other patent applications. For example, clone 
ssm.pk0013.e3 is described in U.S. Patent No. 6,054,636; clone sfl1.pk0040.g11 is described in 
15 PCT publication No. WO 99/43,825, and clone pOY204 is described in PCT publication No. 

WO 00/44,909. 

** Both clones were used together to prepare the probe. 



Probes were prepared by the random primer method using the Random 
20 Primers DNA Labeling System from GIBCO-BRL, Life Technologies according to 
the manufacturer's protocol. The entire plasmid was used as template for all 
probes, except for IFS, where the template was a PCR product containing the IFS 
coding region. This PCR amplification product was obtained as described in 
Example 1, above, for the preparation of WSJ001. 
25 The entire random primer reaction mixture, without purification, was used for 

hybridization. Hybridization conditions were based on a protocol from PerfectHyb 
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Buffer (Sigma-Aldrich, St. Louis, MO). Hybridizations were carried out overnight at 
68°C. The membranes were then washed twice with 2 X SSC buffer (GIBCO BRL, 
Life Technologies) and once with 0.1 X SSC for 15 minutes each at 68°C. 
Immunoblot analyses 

Antibodies to CHS and CHR were prepared by Covance (Richmond, CA) to 
protein purified from E. coli expressing the CHS or CHR coding region using 
standard methods. The IFS antibody was prepared to synthetic peptides of the IFS 
protein as described in WO 00/44,909. Standard protocols were used for 
immunoblot analyses with anti-CHS, anti-CHR, or anti-IFS antisera. The 
Supersignal West Pico chemiluminescent substrate (Pierce, Rockford, IL) was used 
for visualization of the bound antibodies for CHS and CHR, while the Femto 
chemiluminescent substrate (Pierce, Rockford, IL) was used for IFS. 

Table 3 shows the relative detection of the RNA and/or protein of the different 
genes in the isoflavonoid pathway in either seeds from wild type segregant control 
plants or seeds from the CRC recombinant expression construct plants, harvested 
at 150 mg or 250 mg. One plus sign (+) indicates that the RNA or protein is clearly 
detected; +/- indicates that the RNA or protein is barely detected; and more than 
one plus sign indicates the approximate increase in detection of the particular RNA 
or protein levels. 

TABLE 3 

Levels of Expression of Phenylpropanoid Pathway Genes in 





RNA Level 


Protein Level 




150 mg 


250 mg 


150 mg 


250 mg 


Gene 


WT 


CRC 


WT 


CRC 


WT 


CRC 


WT 


CRC 


PAL 


+ 


+++++ 


+ 


+++++ 


nd* 


nd 


nd 


nd 


C4H 


+ 


++++ 


+ 


++++ 


nd 


nd 


nd 


nd 


CHS 


nd 


nd 


nd 


nd 


+ 


+++++ 


+ 




CHI 


+ 


+++ 


+ 


+++ 


nd 


nd 


nd 


nd 


CHR 


+ 


+++ 


+ 


+++ 


+ 


++ 


+ 


++ 


F3H 


+ 


++++ 


+ 


++++ 


nd 


nd 


nd 


nd 


DFR 


+ 


++++ 


+/- 


++++ 


nd 


nd 


nd 


nd 


FS 


+/- 


+++ 


+/- 


+++ 


nd 


nd 


nd 


nd 


IFS 


++ 


++ 


++ 


++ 


+ 


+ 


+ 


+ 


IFR 


+/- 


+/- 


+/- 


+ 


nd 


nd 


nd 


nd 



*not determined 



38 



WO 02/101023 



PCT/US02/21107 



These results indicate that expression of particular soybean genes of the 
phenyipropanoid pathway, as listed above, was increased when the CRC 
recombinant expression construct was expressed in soybean seed. In the upper 
phenyipropanoid pathway the most dramatic changes were observed in expression 
5 of the PAL and CHS genes. Expression of C4H, CHI, and CHR was also increased 
significantly. 

Expression of IFS was not increased. IFR, an enzyme involved in the synthesis 
of glyceollins from daidzein, was not increased in the younger seed and had a slight 
increase in the older seed. 

10 Expression of some genes encoding enzymes involved in the 

flavonol/anthocyanin branch of the phenyipropanoid pathway was increased by 
CRC expression. These include F3H, DFR, and FS. 

It was determined that soybean seed expressing the CRC recombinant 
expression construct present a brown stripe along the median making them easy to 

15 identify. From the analysis of R1 , R2, and R3 seed it was determined that the levels 
of total isoflavones and total daidzein to total genistein ratios vary both in control 
seed and in seed containing the CRC recombinant expression construct. 

Overall the total daidzein to total genistein ratios for seed containing the CRC 
recombinant expression construct ranged between 2.9 and 801.0 and for samples 

20 from control seed ranged between 0.3 and 1.6. There is no overlap in these ranges. 
Of the seed examined, the total isoflavone levels were higher in the R1 seed 
from plants expressing the CRC recombinant expression construct than in plants not 
expressing the CRC recombinant expression construct. With two exceptions the 
total isoflavone levels of R2 seed obtained from field-grown plants were higher in 

25 seed from plants expressing the CRC recombinant expression construct compared 
to seed from plants not expressing the recombinant expression construct. In this 
instance there were two outliers, one seed from the 1-25 transformation event 
containing the CRC recombinant expression construct had lower total isoflavone 
levels than seed from the wt-segregants, and one seed from a wt-segregant of the 

30 1-2 transformation event had unusually high total isoflavone levels. All R3 seed 
examined containing the brown stripe along the median had higher total isoflavone 
levels than seed from wt-segregants. 
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EXAMPLE 7 

Identification of Intermediates of the Phenvlpropanoid Pathway 
that Accumulate in Transformants 
Containing the CRC Recombinant expression construct 
5 Mass spectroscopy was used to determine the differences in HPLC profiles 

between soybean seeds expressing the CRC recombinant DNA fragment and 
control seeds. Using mass spectroscopy three compounds were identified that are 
almost undetectable in wild type seed but present in seed expressing the CRC 
recombinant expression construct. Each of the additionally identified compounds 

10 has an m/z of 505 but differ in retention times of 15.46, 21 .29, and 21 .75 min 

(compare Figure 18, from wild type seed, and Figure 19 from seed expressing the 
CRC recombinant expression construct). MS2 analysis produced one major 
fragment with an m/z of 257 for each of the compounds. This mass indicates the 
loss of a fragment with a mass of 248, which is consistent with fragmentation of a 

15 malonyl-glucose from a conjugated compound. 

Liquiritigenin and isoliquiritigenin, intermediates in daidzein synthesis, both 
have a mass of 256 matching the m/z of 257 detected for each of the unknown 
peaks. The unknowns were further analyzed by first using in-source fragmentation 
(source collision induced dissociation) to remove the 248 m/z fragment leaving the 

20 257 m/z species, followed by MS2. The initial fragmentation was done under 
conditions determined to be ideal for removal of the malonyl-glucose moiety from 
the malonyl-glucose derivatives of daidzein and genistein. MS2 produced the same 
fragments of 239, 147, and 137 for each of the three unknowns. Analysis of 
liquiritigenin and isoliquiritigenin standards showed MS1 spectra with a major peak 

25 of m/z 257 and MS2 fragments of 239, 147, and 137 for each compound. These 
results suggest that the three unknowns are malonyl-glucose derivatives of 
liquiritigenin and/or isoliquiritigenin. 

Further characterization of the liquiritigenin and isoliquiritigenin standards 
showed that the UV spectra and retention times could be used to distinguish the two 

30 compounds. The UV spectrum of liquiritigenin matched that of the unknown with the 
15.5 retention time, while the spectra of the unknowns at 21.3 and 21.8 both are 
similar to the isoliquiritigenin UV spectrum (data not shown). The retention times of 
the unknowns, when compared to the 18.3 and 27.1 retention times of liquiritigenin 
and isoliquiritigenin, respectively, also match expectations based on the differences 

35 between retention times for flavonoid agiycones and their corresponding malonyl- 
glucose conjugates. From this and the above data, it is concluded that the unknown 
at 15.5 is the malonyl-glucose conjugate of liquiritigenin, and the unknowns at 21.3 
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and 21.8 are malonyl-glucose conjugates of isoliquiritigenin. Conjugation of 
isoliquiritigenin at two different positions probably accounts for the latter two peaks. 
Accumulation of these intermediates in the CRC seed suggests that the isoflavone 
synthase catalyzed reaction may be limiting (Figure 1), although increased capture 
5 of intermediates by enhanced activities of genes encoding enzymes involved in 
conjugation is also a possibility. 
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CLAIMS 

What is claimed is: 

1 . A method of altering the isoflavonoid profile of an isoflavonoid-producing 
plant, said method comprising: 

5 (a) transforming a plant with (i) a first recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a recombinant expression construct comprising a 
promoter operably linked to an isolated nucleic acid fragment encoding an R 
myc-type transcription factor, (ii) a second recombinant expression construct 

10 comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 
acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 

15 an R myc-type wherein said construct is capable of functioning as both a C1 myb 
transcription factor and an R myc-type transcription factor; and 

(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; 

wherein expression of the construct or constructs alters the isoflavonoid profile 

20 of the transformed plant by increasing the total daidzein to total genistein ratio 
compared to the total daidzein to total genistein ratio of a control. 

2. The method of Claim 1 wherein the plant is transformed with a 
recombinant expression construct comprising a promoter operably linked to an 
isolated nucleic acid fragment encoding a chimeric transcription factor comprising 

25 the maize R coding region situated between the C1 DNA binding domain and the C1 
activation domain. 

3. The method of Claim 1 or Claim 2 wherein the promoter is a seed-specific 
promoter. 

4. The method of Claim 1 or Claim 2 wherein the isoflavonoid-producing plant 
30 is selected from the group consisting of soybean, clover, mung bean, lentil, hairy 

vetch, alfalfa, lupine, sugar beet, and snow pea. 

5. An isoflavonoid-producing plant made by the method of Claim 1 or 2 
wherein said plant has an increased total daidzein to total genistein ratio compared 
to the total daidzein to total genistein ratio of a control. 

35 6. The isoflavonoid-producing plant of Claim 5 wherein said plant is selected 

from the group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, 
lupine, sugar beet, and snow pea. 
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7. Seeds or plant parts of the plant of Claim 5. 

8. Seeds or plant parts of the plant of Claim 6. 

9. An isoflavonoid-containing product having an increased ratio of total 
daidzein to total genistein obtained from the seeds or plant parts of Claim 7. 

5 10. An isoflavonoid-containing product having an increased ratio of total 

daidzein to total genistein obtained from the seeds or plant parts of Claim 8. 

1 1 . The isoflavonoid-containing product of Claim 9 wherein the isoflavonoid 
product is selected from the group consisting of protein isolate, protein concentrate, 
meal, grits, full fat and defatted flours, textured proteins, textured flours, textured 

10 concentrates, and textured isolates. 

12. The product of Claim 10 wherein the isoflavonoid-containing product is 
selected from the group consisting of protein isolate, protein concentrate, meal, 
grits, full fat and defatted flours, textured proteins, textured flours, textured 
concentrates, and textured isolates. 

15 13. An extracted isoflavonoid-containing product having an increased ratio of 

total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 7. 

14. An extracted isoflavonoid-containing product having an increased ratio of 
total daidzein to total genistein wherein said product is extracted from the seeds or 

20 plant parts of Claim 8. 

15. A food which has incorporated therein the product of Claim 9. 

16. A food which has incorporated therein the product of Claim 10. 

17. A beverage which has incorporated therein the product of Claim 9. 

18. A beverage which has incorporated therein the product of Claim 10. 

25 19. An isoflavonoid-containing soy protein product having an increased ratio of 

total daidzein to total genistein obtained from the seeds of Claim 8 wherein the 
seeds are soybean seeds. 

20. The product of Claim 19 wherein the isoflavonoid product is selected from 
the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 

30 defatted flours, textured proteins, textured flours, textured concentrates, textured 
isolates, soymilk, tofu, fermented soy products, and whole bean soy products. 

21. An extracted isoflavonoid-containing soy protein product having an 
increased ratio of total daidzein to total genistein wherein said product is extracted 
from the seeds of Claim 8 wherein the seeds are soybean seeds. 

35 22. A food which has incorporated therein the product of Claim 19. 

23. A beverage which has incorporated therein the product of Claim 19. 
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24. A method of producing an isoflavonoid-containing product 
which comprises: 

(a) cracking the seeds of Claim 7 to remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
5 thickness. 

25. A method of producing an isoflavonoid-containing product which 
comprises: 

(a) cracking the seeds of Claim 8 remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 

10 thickness. 

26. The method of Claim 25 wherein the seeds are soybean seeds. 

27. An isoflavonoid-producing plant comprising in its genome 

(i) a first recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a C1 myb transcription 

15 factor and a recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding an R myc-type transcription 
factor, 

(ii) a second recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a C1 myb transcription 

20 factor and a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, or 

(iii) a recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding all or part of a C1 myb 
transcription factor and all or part of an R myc-type wherein said construct is 

25 capable of functioning as both a C1 myb transcription factor and an R myc-type 
transcription factor; 

wherein said plant has an increased total daidzein to total genistein ratio when 
compared to the total daidzein to total genistein ratio of a control. 

28. The isoflavonoid-producing plant of claim 27 wherein the recombinant 
30 expression construct (iii) comprises a promoter operably linked to an isolated 

nucleic acid fragment encoding a chimeric transcription factor comprising the maize 
R coding region situated between the C1 DNA binding domain and the C1 activation 
domain. 

29. The isoflavonoid-producing plant of claim 27 or 28 wherein the promoter is 
35 a seed-specific promoter. 

30. The isoflavonoid-producing plant of Claim 27 or Claim 28 wherein the 
isoflavonoid-producing plant is selected from the group consisting of soybean, 
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clover, mung bean, lentil, hairy vetch, alfalfa, lupine, sugar beet, and snow pea. 

31 . The isoflavonoid-producing plant of Claim 27 or 28 wherein said plant has 
an increased total daidzein to total genistein ratio compared to the total daidzein to 
total genistein ratio of a control. 
5 32. The isoflavonoid-producing plant of Claim 31 wherein said plant is selected 

from the group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, 
lupine, sugar beet, and snow pea. 

33. Seeds or plant parts of the plant of Claim 31 

34. Seeds or plant parts of the plant of Claim 32. 

10 35. An isoflavonoid-containing product having an increased ratio of total 

daidzein to total genistein obtained from the seeds or plants parts of Claim 33. 

36. An isoflavonoid-containing product having an increased ratio of total 
daidzein to total genistein obtained from the seeds or plant parts of Claim 34. 

37. The product of Claim 35wherein the isoflavonoid product is selected from 
15 the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 

defatted flours, textured proteins, textured flours, textured concentrates, and 
textured isolates. 

38. The product of Claim 36 wherein the isoflavonoid-containing product is 
selected from the group consisting of protein isolate, protein concentrate, meal, 

20 grits, full fat and defatted flours, textured proteins, textured flours, textured 
concentrates, and textured isolates. 

39. An extracted isoflavonoid-containing product having an increased ratio of 
total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 33. 

25 40. An extracted isoflavonoid-containing product having an increased ratio of 

total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 34. 

41 . A food which has incorporated therein the product of Claim 35. 

42. A food which has incorporated therein the product of Claim 36. 

30 43. A beverage which has incorporated therein the product of Claim 35 

44. A beverage which has incorporated therein the product of Claim 36. 

45. An isoflavonoid-containing soy protein product having an increased ratio of 
total daidzein to total genistein obtained from the seeds of Claim 34 wherein the 
seeds are soybean seeds. 

35 46. The product of Claim 44 Wherein the isoflavonoid product is selected from 

the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 
defatted flours, textured proteins, textured flours, textured concentrates, textured 
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isolates, soymilk, tofu, fermented soy products, and whole bean soy products. 

47. An extracted isoflavonoid-containing soy protein product having an 
increased ratio of total daidzein to total genistein wherein said product is extracted 
from the seeds of Claim 34 wherein the seeds are soybean seeds. 
5 48. A food which has incorporated therein the product of Claim 40. 

49. A beverage which has incorporated therein the product of Claim 40. 

50. A method of producing an isoflavonoid-containing product 
which comprises: 

(a) cracking the seeds of Claim 33 to remove the meats from the hulls; 

10 and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
thickness. 

51. A method of producing an isoflavonoid-containing product which 
comprises: 

15 (a) cracking the seeds of Claim 34 remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
thickness. 

52. The method of Claim 50 wherein the seeds are soybean seeds. 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours 

<12 0> A METHOD FOR ALTERING THE ISOFLAVONOID PROFILE IN THE PLANT PARTS OF 
AN ISOFLAVONOID -PRODUCING PLANT 

<130> BB1452 PCT 

<140> 
<141> 

<150> 60/297,981 

<151> 2001-06-13 

<160> 16 

<170> Microsoft Office 97 

<210> 1 
<211> 22 
<212> DNA 

<213> Artif icial/Oligonucleotide primer 
<400> 1 

aggcggaaga actgctgcaa eg 22 



<210> 2 
211> 22 
<212> DNA 

<213> Artif icial/Oligonucleotide primer 
<400> 2 

aggtccattt egtegcagag gc 22 



<210> 3 
<211> 24 
<212> DNA 

<213> Artif icial/PCR primer 
<400> 3 

atgtttggca agtaggaagg gacc 24 



<210> 4 
<211> 24 
<212> DNA 

<213> Artif icial/PCR primer 
<400> 4 

gcattccata agccgtcacg attc 24 



<210> 5 

<211> 1919 

<212> DNA 

<213> Glycine max 
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<400> 5 

gcacgagacc ccacacagca accagagcag ctatgctagt gagaatcaac acactcctcc 60 

aaggctactc aggaatcagg tttgaaattt tggaggcaat cacaaagctt ctgaacaaca 12 0 

acattacccc atgtttgcca cttaggggaa caatcacagc atctggtgat cttgttcctt 180 

tgtcctacat tgctggfcttg ctaactggta gaccaaactc caaggctgtt ggaccctctg 240 

gtgagattct gaatgccaaa gaagcctttg aattggccaa catcggtgct gagttctttg 300 

agttgcaacc taaggaaggc cttgcccttg tgaatggcac tgctgttggt tctggcttgg 360 

cctcaattgt tctatttgaa gccaacatca ttgctgtctt gtctgaagtt atttcagcaa 42 0 

tttttgctga agttatgcaa ggaaagcctg aattcactga ccatttgact cataaactaa 480 

agcaccaccc tggtcagatt gaagctgctg ctattatgga acacattttg gaaggaagct 540 

cttacgtgaa agctgctaag aagttgcatg agattgatcc tttacaaaag cctaaacagg 600 

accgttatgc tcttaggact tcaccacaat ggcttggtcc tctaattgaa gtgattagat 660 

tctctaccaa gtcaattgag agggagatta actcagtcaa tgacaaccct ttgattgatg 72 0 

tgtcaaggaa caaggcactt catggtggta acttccaagg aactcctatt ggagtctcca 78 0 

tggataatac acgtttggct cttgcttcaa ttggtaaact catgtttgct caattctctg 84 0 

agcttgtcaa tgattattac aacaatggtt tgccttcaaa tctcactgcc agcagaaacc 900 

ccagcttgga ttatggattc aagggagctg aaattgccat ggcatcttat tgttctgaac 960 

ttcaatattt ggcgaatccg gtgacaagcc acgtgcaaag cgcggagcaa cacaaccaag 102 0 

atgtgaactc tctggggctg atttcatcaa ggaagactca tgaggctatt gagatcctca 1080 

agctcatgtc ctccactttc ctggtcgccc tttgccaagc cattgacttg aggcatttgg 1140 

aggagaattt gaagaacacg gtcaagaacg ttgtgagtca agttgctaag aggactctca 12 0 0 

ccacaggtgt caatggagag cttcaccctt caaggttttg tgagaaggac ttgctcaagg 1260 

ttgttgatag ggagtacaca tttgcataca ttgatgaccc ctgcagtgga acataccctt 132 0 

tgatgcaaaa gctaaggcaa gtgcttgtgg actatgcatt ggccaatgga gagaacgaga 13 8 0 

agaacacaaa cacatcaatc ttccaaaaga ttgcaacatt tgaggaagag ttgaagaccc 144 0 

ttttgcctaa ggaagtggaa ggtgcaagag ttgcatatga gaatgaccaa tgtgcaattc 150 0 

caaacaagat caaggaatgc aggtcttacc ccttgtacaa gtttgtgaga gaggagttgg 1560 

ggacagcatt gctaactggt gaaagggtta tctcaccggg tgaagagtgt gacaaagtgt 162 0 

tcactgcttt gtgccaaggg aagatcattg atccactttt ggaatgcctt ggggagtgga 1680 

atggggcacc tcttccaata tgtfcagtttt tcttattttc tgttttcttg aagagtggtt 1740 

tcttttctgt acacgtgttt gtgttgatat taagcatttg gtttgtctat ataaggctgt 1800 

ggcaaatcaa tccacataca acaacttccc agttttcctt gatgtatgcc atgcaaggaa 1860 

cttgtaattc ataatgtaat agaattccat ttgtttgccg taaaaaaaaa aaaaaaaaa 1919 

<210> 6 

<211> 1812 

<212> DNA 

<213> Glycine max 

<400> 6 

ccaaactgaa ccaaatcatc acacacacat actaagaagc aacaattctt ctacaatgga 60 

tctcctcctt ctggaaaaga ccctcatagg tctcttcctc gctgcggtgg tcgccatcgc 12 0 

cgtctccacc ctccgcggcc ggaaattcaa gctcccaccg ggcccactcc ccgtcccaat 18 0 

cttcggcaac tggctccaag tcggcgacga cctcaaccac cgcaacctca ccgatttggc 24 0 

caaaaaattc ggtgacatct tcctcctccg catggggcag cgcaacctcg tcgtggtttc 30 0 

ttcccctgag ctcgccaaag aggttctcca cacgcagggc gtggagttcg gctcccgcac 360 

ccgcaacgtc gtcttcgaca tcttcaccgg aaagggccaa gacatggtct tcaccgtcta 42 0 

cggcgagcac tggcgcaaaa tgcgccgcat catgaccgtc cccttcttca ccaacaaggt 48 0 

tgtgcaacaa taccgccatg gatgggaatc ggaggctgcc gccgtcgtcg aggacgtcaa 54 0 

gaaaaacccc gacgccgccg tctccggcac cgtcatccgc cgccgccttc agctcatgat 600 

gtacaacaac atgtaccgca taatgttcga ccggaggttc gagagcgagg aggatcccat 660 

cttccagagg ctaagagcct tgaacggaga gaggagtcgc ttggcgcaga gctttgagta 72 0 

taactatggt gattttattc ccatcttgag acccttcttg aagggttact tgaagatttg 78 0 

caaggaggtg aaggagacga ggttgaagct tttcaaggat tacttcgttg acgagaggaa 84 0 

gaagcttgga agcaccaaga gcaccaacaa caataatgaa cttaaatgcg ctattgacca 90 0 

cattttggat gcccagagaa aaggcgagat caacgaagac aacgtcctct acattgttga 960 

aaacatcaac gttgctgcaa ttgaaacaac tctatggtcg attgagtggg gcattgctga 102 0 

gcttgtgaac cacccagaga tccagcaaaa gttaagggat gagattgaca gagttcttgg 1080 

agcagggcac caagtgactg agccagacat ccaaaagctc ccatacctcc aagcagtggt 114 0 
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caaggaaact cttcgtctta gaatggcaat ccctctcctt gtaccacaca tgaacctcca 1200 

cgacgcaaag cttgggggct atgatatccc agctgagagc aagatcttgg tgaatgcatg 1260 

gtggctggcc aacaaccctg cacactggaa gaagccagag gagttccggc ctgagaggtt 1320 

cttcgaggag gagtcgcttg ttgaagccaa tggcaatgac tttaggtacc ttccctttgg 13 80 

tgttggcaga agaagctgcc ctggaatcat tcttgcattg ccaattcttg gcatcacttt 1440 

gggacgtttg gtccaaaact ttgagctctt gcctccccct ggccagtcac agattgacac 1500 

tagtgagaaa ggagggcaat ttagcttgca catactcaag cattccacca ttgtggcaaa 1560 

gccaaggtca ttttagactc caccacatga tcatgatcat atgatcaatc cccttaattt 162 0 

tttctttttt cttttccctt ttctttactc tgtattgtat caatgcttga aaatggggtt 1680 

gttccaaaaa tgtcatatat atttggcctg ctaatgggta ttgtaaatct ctgaacttga 174 0 

agtgatgcac ggttttgaga hggttttcta ataaacctac acttttgtct ctaaaaaaaa 1800 

aaaaaaaaaa aa 1812 

<210> 7 

<211> 828 

<212> DNA 

<213> Glycine max 

<400> 7 

gttaaataga aaagaggagt ttgagaatgg caacgatcag cgcggttcag gtggagttcc 60 

tggagtttcc agcggtggtt acttcaccag cctccggcaa gacctatttc ctcggcggcg 120 

caggggagag aggattgacg attgagggga agttcataaa gttcacaggc ataggagtat 18 0 

acttggagga taaggcggtg ccatcactcg ccgctaagtg gaagggtaaa acttcagagg 240 

agttagttca caccctccac ttctacaggg atatcatttc agggccgttt gaaaagctaa 300 

ttagagggtc gaagattctg ccattggctg gcgctgaata ctcaaagaag gtgatggaaa 3 60 

actgcgtggc acacatgaag tctgttggga cttacggtga tgctgaagcc gcagccattg 420 

aaaagtttgc tgaagccttc aagaatgtga attttgcacc tggtgcctct gttttctaca 480 

gacaatcacc tgatggaatc ttggggctta gtttctctga agatgcaaca ataccagaaa 540 

aggaggctgc agtgatagag aacaaggctg tatcagcggc ggtcttggag accatgattg 600 

gtgaacatgc tgtttcccct gacttaaaac gcagtttggc ttctcgattg cctgcggtat 660 

tgagccacgg cattatagtc tgagaaatga gaaggatcaa ctttaccttt ttcaaatatt 72 0 

cttgtttttc tcctttcttt cttgtcgctt gtcatgtatt tctactgttt tattaaataa 780 

taaaattgag ttctgttaga gttggtgaaa aaaaaaaaaa aaaaaaaa 828 

<210> 8 

<211> 1394 

<212> DNA 

<213> Glycine max 

<400> 8 

gttagaatgg ctgctgctat tgaaatcccc acaatagtgt ttccaaactc ctctgcccaa 60 

cagaggatgc cagtggttgg aatgggatct gcccctgact tcacatgcaa gaaagacaca 12 0 

aaggaggcta tcattgaggc cgtgaaacag ggttacagac acttcgacac tgctgctgct 180 

tatggctctg aacaggctct cggtgaagct ctcaaggaag ctatccatct tggcctcgtc 240 

tcccgccaag acctctttgt cacttccaag ctttgggtca ccgaaaatca tcctcatctt 3 00 

gtccttcctg ctttgcgcaa atcacttaaa actcttcaac tagagtactt ggacctgtat 360 

ctcatccact ggcccctgag ttctcagcca gggaagttct catttccaat tgaagtagaa 42 0 

gatctcttgc cttttgacgt gaagggtgtg tgggaatcca tggaagagtg ccagaaactt 480 

ggcctcacca aagccattgg agtcagcaac ttctctgtca agaagcttca gaatctgcfcc 54 0 

tctgttgcta ccatccgtcc cgtggtcgat caagtggaga tgaaccttgc atggcaacag 600 

aagaagctaa gagagttctg caaagaaaat gggataatag tgactgcgtt ctctccactg 660 

aggaaaggtg cgagcagggg cccaaatgaa gtgatggaga atgatgtgct gaaggagatt 72 0 

gcagaggctc atgggaaatc catagcccag gtgagtctga gatggttgta cgaacaaggt 78 0 

gtgacatttg tgccaaagag ctacgataag gagaggatga accagaatct gcacatcttt 84 0 

gactgggcat tgactgaaca agatcatcac aaaataagtc aaatcagcca gagccgtttg 90 0 

atcagcggac ccaccaaacc acaactcgct gatctctggg atgatcaaat ataaactatt 960 

tactactatg cagctcccac tctattttta taatccatct ttttacctct tgttcatttg 102 0 

tgccaaagag ctacgataag gagaggatga accagaatct gcacatcttt gactgggcat 1080 

tgactgaaca agatcatccc aaaataagtc aaatcagcca gagccgtttg atcagcggac 114 0 

cccccaaacc acaactcgct gatctctggg atgatcaaat ataaactatt tactactatg 12 0 0 
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cagctcccac 


tctattttta 


taatccatct 


ttttacctct 


tgtttcattt 


tacgtttaaa 


1260 


taattcatgc 


catgccactt 


cttattttag 


atttcacaat 


caataaacta 


ggcacgcgcg 


1320 


gcacatgata 


tgaataaact 


atgttcaatt 


tttttttcaa 


aaaaaaaaaa 


aaaaaaaaaa 


1380 


aaaaaaaaaa 


aaaa 












<210> 9 














<211> 1756 














<212> DNA 














<213> Glycine max 












<400> 9 














gtaattaacc 


tcactcaaac 


tcgggatcac 


agaaaccaac 


aacagttctt 


gcactgaggt 


SO 


ttcacgatgt 


tgctggaact 


tgcacttggt 


ttgtttgtgt 


tagctttgtt 


tctgcacttg 


120 


cgtcccacac 


caagtgcaaa 


atcaaaagca 


cttcgccacc 


tcccaaaccc 


tccaagccca 


180 


aagcctcgfcc 


ttcccttcat 


tggccacctt 


cacctcttaa 


aagataaact 


tctccactat 


240 


gcactcatcg 


atctctccaa 


aaagcatggc 


cccttattct 


ctctctcctt 


cggctccatg 


300 


ccaaccgtcg 


ttgcctccac 


ccctgagttg 


ttcaagctct 


tcctccaaac 


ccacgaggca 


360 


acttccttca 


acacaaggtt 


ccaaacctct 


gccataagac 


gcctcactta 


cgacaactct 


420 


gtggccatgg 


ttccattcgg 


accttactgg 


aagttcgtga 


ggaagctcat 


catgaacgac 


480 


cttctcaacg 


ccaccaccgt 


caacaagctc 


aggcctttga 


ggacccaaca 


gatccgcaag 


540 


ttccttaggg 


ttatggccca 


aagcgcagag 


gcccagaagc 


cccttgacgt 


caccgaggag 


600 


cttctcaaat 


ggaccaacag 


caccatctcc 


atgatgatgc 


tcggcgaggc 


tgaggagatc 


660 


agagacatcg 


ctcgcgaggt 


tcttaagatc 


ttcggcgaat 


acagcctcac 


tgacttcatc 


720 


tggcctttga 


agtatctcaa 


ggttggaaag 


tatgagaaga 


ggattgatga 


catcttgaac 


780 


aagttcgacc 


ctgtcgttga 


aagggtcatc 


aagaagcgcc 


gtgagatcgt 


cagaaggaga 


840 


aagaacggag 


aagttgttga 


gggcgaggcc 


agcggcgtct 


tcctcgacac 


tttgcttgaa 


900 


ttcgctgagg 


acgagaccat 


ggagatcaaa 


attaccaagg 


agcaaatcaa 


gggccttgtt 


960 


gtcgactttt 


tctctgcagg 


gacagattcc 


acagcggtgg 


caacagagtg 


ggcattggca 


1020 


gagctcatca 


acaatcccag 


ggtgttgcaa 


aaggctcgtg 


aggaggtcta 


cagtgt tgtg 


1080 


ggcaaagata 


gactcgttga 


cgaagttgac 


actcaaaacc 


ttccttacat 


tagggccatt 


1140 


gtgaaggaga 


cattccgaat 


gcacccacca 


ctcccagtgg 


tcaaaagaaa 


gtgcacagaa 


1200 


gagtgtgaga 


ttaatgggta 


tgtgatccca 


gagggagcat 


tggttctttt 


caatgtttgg 


1260 


caagtaggaa 


gggaccccaa 


atactgggac 


agaccatcag 


aattccgtcc 


cgagaggttc 


1320 


ttagaaactg 


gtgctgaagg 


ggaagcaggg 


cctcttgatc 


ttaggggcca 


gcatttccaa 


1380 


ctcctcccat 


ttgggtctgg 


gaggagaatg 


tgccctggtg 


tcaatttggc 


tacttcagga 


1440 


atggcaacac 


ttcttgcatc 


tcttatccaa 


tgctttgacc 


tgcaagtgct 


gggccctcaa 




ggacaaatat 


tgaaaggtga 


tgatgccaaa 


gttagcatgg 


aagagagagc 


tggcctcaca 


1560 


gttccaaggg 


cacatagtct 


cgtttgtgtt 


ccacttgcaa 


ggatcggcgt 


tgcatctaaa 


1620 


ctcctttctt 


aattaagata 


atcatcatat 


acaatagtag 


tgtcttgcca 


tcgcagttgc 


1680 


tttttatgta 


ttcataatca 


tcatttcaat 


aaggtgtgac 


tggtacttaa 


tcaagtaatt 


1740 


aaggttacat 


acatgc 










1756 


<210> 10 














<211> 1465 














< 2 1 2 > DNA 














<213> Glyc: 


Lne max 












<400> 10 














gcacgagggc 


acgaggaagc 


attgcattct 


gctatttaat 


tccactacgt 


acacgcacat 


60 


tctcctcaaa 


gacaacaatg 


gcaccaacag 


ccaagactct 


gacttacctg 


gcccaggaga 


120 


aaaccctaga 


atcgagcttc 


gttcgggacg 


aggaggagcg 


tcccaaggtt 


gcctacaacg 


180 


aattcagcga 


cgagatccca 


gtgatttctc 


ttgccggaat 


cgacgaggtg 


gatggacgca 


240 


gaagagagat 


ttgtgagaag 


atcgtggagg 


cttgcgagaa 


ttggggtata 


ttccaggttg 


300 


ttgatcacgg 


tgtggatcaa 


caactcgtgg 


ccgagatgac 


ccgtctcgcc 


aaagagttct 


360 


ttgctttgcc 


accggacgag 


aagcttcgtt 


ttgatatgtc 


cggcgccaaa 


aagggtggat 


420 


tcattgtctc 


cagccatctc 


caaggggaat 


cggtgcagga 


ctggagagaa 


atagtgacat 


480 


acttttcgta 


cccaaaaaga 


gagagggact 


attcaaggtg 


gccagacacg 


ccagaagggt 


540 


ggagatcggt 


gactgaggaa 


tacagcgaca 


aagtaatggg 


tctagcttgc 


aagctcatgg 


600 


aggtgttgtc 


cgaagcaatg 


gggttagaga 


aagagggttt 


aagcaaagca 


tgtgttgaca 


660 


tggaccagaa 


ggtggtggtt 


aattactacc 


ccaaatgccc 


tcaacctgac 


ctcactcttg 


720 
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gcctgaagcg 


ccacacggat 


ccgggcacta 


tcaccttgct 


gcttcaggac 


caagtgggtg 


780 


gacttcaagc 


caccagggac 


aatggcaaaa 


catggatcac 


cgttcagcct 


gtggaggctg 


840 


ccttcgtcgt 


caatcttgga 


gatcatgctc 


attatctgag 


caatggaagg 


ttcaagaatg 


900 


ctgatcacca 


agcggtggtg 


aactcaaacc 


atagccgttt 


gtccatagcc 


acttttcaaa 


960 


acccagcacc 


aaatgcaact 


gtttaccctc 


tgaagataag 


agaaggagag 


aagcctgtga 


1020 


tggaggaacc 


aatcactttt 


gctgaaatgt 


acaggaggaa 


gatgagcaag 


gacattgaga 


1080 


ttgcaaggat 


gaagaagctg 


gctaaggaaa 


agcatttgca 


ggaccttgag 


aatgaaaagc 


1140 


atttgcaaga 


acttgatcag 


aaggcaaaac 


ttgaggccaa 


gcctttgaag 


gagattcttg 




cttaattaat 


aataattaca 


tatgtatcat 


ttgcatgccc 


ccttggtgtt 


tttagtattt 


1260 


tthaagggcc 


atgaattaat 


aatagtcctt 


acctttgtgc 


ttttgtacgt 


cttatgattt 


1320 


atcctttgtg 


gggatatcat 


gtgttgtgtt 


cagttgccta 


tgtcttatta 


gctagctggc 


1380 


tcatctatgt 


ataccttata 


tgtgcctcta 


ttataaatga 


aaataagtgg 


cactgtcttt 


1440 


atfcaaaaaaa 


aaaaaaaaaa 












<210> 11 














<211> 1279 














<212> DNA 














<213> Glyc: 


Lne max 












<400> 11 














ataattcaac 


tgttttgggg 


tgtgattaaa 


agagaagcta 


gctaaaaaaa 


atgggttcag 


60 


catccgaaag 


tgtttgcgtt 


acaggagctt 


ctggtttcat 


cgggtcatgg 


cttgtcatga 


120 


gactcatcga 


gcgtggctac 


accgttcgag 


ccaccgtacg 


cgacccagta 


aacatgaaga 


180 


aggtgaagca 


tttggtggaa 


ctaccaggcg 


caaagagcaa 


actgtctctg 


tggaaggctg 


240 


atcttgctga 


agagggaagc 


tttgatgaag 


ccattaaagg 


ctgcaccgga 


gttttccacg 


300 


tggccacccc 


catggacttt 


gaatccaaag 


accctgagaa 


tgaagtgata 


aagcctacaa 


360 


taaatggggt 


actagacatc 


atgaaagcat 


gcttgaaggc 


aaaaactgtg 


cgaaggctaa 


420 


tattcacgtc 


ctcagccgga 


accctcaacg 


ttattgagcg 


ccaaaagccc 


gttttcgacg 


480 


acacatgctg 


gagtgacgtt 


gagttttgcc 


gtagagttaa 


gatgactggt 


tggatgtatt 


540 


ttgtttctaa 


aacactggcg 


gagaaagaag 


catggaaatt 


tgccaaagag 


cagggcctgg 


600 


acttcatcac 


tatcattcca 


cctcttgttg 


tcggtccctt 


tctgatgcca 


accatgccac 


660 


ctagcctaat 


cacggctcta 


tcgccaatca 


caggaaatga 


ggaccattac 


tcgatcataa 


720 


agcaaggtca 


attcgtccac 


ttagatgatc 


tctgtcttgc 


tcacatattt 


ctgtttgagg 


780 


aaccagaagt 


ggaagggagg 


tacatatgca 


gtgcatgtga 


cgctaccatt 


catgacattg 


840 


ccaaattaat 


taaccaaaaa 


taccctgagt 


acaaggtccc 


caccaagttc 


aagaatattc 


900 


cagatcaatt 


ggagcttgtg 


agattttctt 


ccaagaagat 


cacagacttg 


ggattcaaat 


960 


ttaaatacag 


cttagaggac 


atgtacactg 


gagcaattga 


cacatgcaga 


gacaaagggc 




ttcttccgaa 


acctgcagaa 


aaagggcttt 


ttactaaacc 


tggagaaact 


ccagtgaatg 


1080 


ccatgcataa 


ataggcattc 


atatctttgt 


atctgtgtga 


tggctgtgca 


acttgctttt 


1140 


cttattccgt 


tgagtggctt 


ttcttgatta 


acgtttctgt 


tttatgaaaa 


attagaaatg 


1200 


tgagtggctt 


gtaaggccag 


gttatcttca 


ataagttaat 


aaaaaccatc 


ttctaaagtc 


1260 


taaaaaaaaa 


aaaaaaaaa 










1279 


<210> 12 














<211> 1234 














<212> DNA 














<213> Glycine max 












<400> 12 














gcacgagatt 


ttatttttct 


ttctttcttt 


ggaagataaa 


gaatgggttc 


taagtccgaa 


60 


accgtttgcg 


ttactggggc 


ttctggttac 


atcggatcat 


ggcttgtcat 


gagactcatc 


120 


gagcgtggct 


ataccgttcg 


agccaccgta 


ctcgacccag 


ctgatatgag 


ggaggtgaag 


180 


catttgctgg 


atctgccagg 


tgcagagagc 


aagctgtctc 


tgtggaaggc 


agaacttaca 


240 


gaagagggaa 


gctttgatga 


agccattaaa 


gggtgcacag 


gtgttttcca 


cttggccacc 


300 


cccgttgact 


ttaagtccaa 


agacccagag 


aatgaaatga 


taaagcctac 


aattcaagga 


3 60 


gtactaaaca 


tcatgaaagc 


atgcctgaag 


gcaaaaactg 


tccgaaggct 


agtattcacg 


420 


tcctcagccg 


gaactaccaa 


cattactgag 


caccaaaagc 


ctatcattga 


cgaaacctgc 


480 


tggactgatg 


ttgagttctg 


ccggagatta 


aatatgactg 


gttggatgta 


tttcgtttct 


540 


aaaacacttg 


cggagaaaga 


agcttggaaa 


tttgcgaaag 


agcacggcat 


ggacttcatc 


600 


gctatccttc 


cagctcttgt 


cattggtccc 


tttctactgc 


caacaatgcc 


ttctagcgtg 


660 
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atcagtgctc tttcacctat taacggaatt 

caattcgtcc acatagaaga tatctgtctt 

gcagaaggga ggtatatatg cagtgcatgt 

ahtaacgaaa aatacccaga gtacaaggtt 

ttggagcccg tgagattttc ttccaagaaa 

agcttagagg atatgtacac tggagcaatt 

aaacctgcag aaattccagc gaatggcatc 

thgcctcggt gatggcfcatg aatgttgctt 

gttttgtgaa ttcgtagtca aaattgtaag 

aattgtttaa tattatcaca aaaaaaaaaa 

<210> 13 

<211> 1345 

<212> DMA 

<213> Glycine max 

<400> 13 

gcacgagaat caacacacac aaacacaaca 
agcttccaaa tccaaagatg ctgccatccc 
accaggcatc acaaccgttc aaggggtgaa 
cccagatgaa gggaaagtgg tgcatgagat 
ccaaattgtg aaccatgaca tacctagtga 
aatgttcttt gagttgccac aagaggaaaa 
ttctattgaa gggtatggca caaagcttca 
ggatcatttg ttccacattg tgtggcctcc 
aaacccccct tcttacaggg aagttaatga 
agacaaattg ttcaaaagta tgtcggtagg 
gggtgcaaat gaagatgaca tgcattatct 
atgtcctgat ctggtcttgg gtgtgccacc 
ggtgcctaac gaggtgcagg gccttcaagc 
gtatgtcccc aatgccctcg ttattcacat 
aaaatataag gcagtttttc acagaacaac 
gcccgtgttc atagaaccca aaaaggaaca 
ccaagacaat ccaccaaaat acaaaaccaa 
caataagatc cctcaatgaa tgaagtgggc 
ctgttttgtt atttaattta tggaatgaaa 
atctgtgact aatgttggtt tctctgtttt 
ttgtgggaag tagattaaat aagatctgtg 
agtaaataaa gatgacaaga aacagtttgt 
aaaaaaaaaa aaaaaaaaaa aaaaa 

<210> 14 

<211> 911 

<212> DNA 

<213> Glycine max 

<400> 14 

agaccaagtc aagatcgttg cagcaataaa 
atctgaattt gggctggatg tggaccgtca 
cgaggaaaaa gcgaaaattc gaagagtaat 
atgttgtcat gcctttactg gttatttctt 
tcctcctagg gacaaggtgt tcatacaagg 
tgaggctgat gtgggaactt ttaccatcga 
agccgtgcac ataagactcc caaacaatta 
ggagaaaaaa attgggaaaa ctcttgagaa 
gcaaattaag gagacttctt tcctaaataa 
gataaaggga gatgcagtgt atgagattga 
ttatcctcac gtggaataca gcactgtttc 
gaaatcttag gaaccaaagc acttatatga 
caaatgcagt tcttccttct gtttttcttt 



gaggcacatt 


attcaatcat 


aaagcaagct 


720 


gctcacatat 


ttctgtttga 


acagccaaaa 


780 


gacgttacta 


tccatgacat 


tgtaaaatta 


840 


cccaccaagt 


ttcagaacat 


tccagatcaa 


900 


atcacagact 


tgggattcca 


atttaaatac 


960 


gatacatgca 


tagagaaagg 


gcttcttcct 


1020 


gagcataaat 


aaatataggt 


tttcatatct 


1080 


ttcttgttca 


gtttctttaa 


tgatgtttcc 


1140 


tggtttgtaa 


gaccaaatta 


gttatctaca 


1200 


aaaa 






1234 


acatatggag 


gtgctaaggg 


tgcaaaccat 


60 


agccatgttt 


gttagggcag 


agacagagca 


120 


ccttgaggtg 


ccaattattg 


attttagtga 


180 


tttggaggca 


agtagggact 


ggggcatgtt 


240 


tgttataaga 


aagttgcaaa 


gtgttgggaa 


300 


agagttgatt 


gctaagcctg 


ctgggtctga 


360 


gaaagaggtg 


aatggcaaga 


aagggtgggt 


420 


ttcctccatc 


aactacagtt 


tctggcccca 


480 


ggaatattgc 


aagcacctaa 


gaggagtggt 


540 


gtfcggggctt 


gaagagaatg 


agctaaagga 


600 


tttaaaaatc 


aattattacc 


caccttgtcc 


660 


acacacagac 


atgtcctacc 


tcacaattct 


720 


atgtagggat 


ggccattggt 


acgatgttaa 


780 


tggcgaccaa 


atggagatac 


tgagcaatgg 


840 


agtgaacaaa 


gatgagacaa 


gaatgtcgtg 


900 


agaagttggt 


cctcacccaa 


agttggttaa 


960 


gaaatacaag 


gattatgctt 


attgtaagct 


1020 


actacgagat 


aattatagct 


catggtttct 


1080 


gtgtgacttg 


tgggaagtag 


attaaataag 


1140 


gttatttaat 


ttatggaatg 


aaagtgtgac 


1200 


actaatgttg 


gattaatctt 


atgcttttga 


1260 


ctgttctgtt 


aaaaaaaaaa 


aaaaaaaaaa 


1320 








1345 


agaagctgga 


aacgtcaaga 


gatttttccc 


60 


cgatgcggct 


gagcctgtaa 


gagaagtttt 


120 


tgaagctgaa 


ggaattcctt 


acacttacct 


180 


acgtaacctg 


gcacagattg 


acatcactgt 


240 


agatggaaat 


gtgaaaggag 


cgtatattac 


300 


agcagcaaat 


gaccctagag 


ccttgaacaa 


360 


tttgtcctta 


aatgatatca 


tctctttgtg 


420 


aatttatgtt 


tcagaagaag 


aagttcttaa 


480 


ttatcttctg 


gcactatacc 


actcacagca 


540 


ccctgccaaa 


gaccttgagg 


cttctgaggc 


600 


tgaatatttg 


gatcagtttg 


tctgatttcc 


660 


ttttatcact 


ctgcaaaatg 


cttaataaaa 


720 


cagaaaaggc 


ctatgcaggc 


ttttttgctg 


780 
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cactaccatt gtctgtttcg tagtggttgc ttgtgtttgt cctgttctca agaaagtatc 84 0 

cacatttaat cagtttaggt aagtgttcta cctgaaaaaa taataatata atcctgtttt 90 0 

aatattgtta c 911 

<210> 15 
<211> 21 
<212> DNA 

<213> Artif icial/PCR primer 
<400> 15 

ttgctggaac ttgcacttgg t 21 



<210> 16 
<211> 32 
<212> DNA 

<213> Artif icial/PCR primer 
<400> 16 

gtatatgatg ggtaccttaa ttaagaaagg ag 



32 
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TITLE 

A METHOD FOR ALTERING THE ISOFLAVONOID PROFILE 
IN THE PLANT PARTS OF AN ISOFLAVONOID-PRODUCING PLANT 
This application claims priority to U.S. Provisional Application No. 60/297,981, 
5 filed June 13, 2001 incorporated herein by reference in its entirety. 

This invention pertains to methods of altering the ratios of individual 
isoflavonoids in isoflavonoid-producing plants by using a C1 myb transcription factor 
and an R-type myc transcription factor that regulate expression of genes in the 
phenylpropanoid pathway. 

10 Isoflavonoids represent a class of secondary metabolites produced in legumes 

by a branch of the phenylpropanoid pathway and include such compounds as 
isoflavones, isoflavanones, rotenoids, pterocarpans, isoflavans, quinone derivatives, 
3-aryl-4-hydroxycoumarins, 3-arylcoumarins, isoflav-3-enes, coumestans, 
alpha-methyldeoxybenzoins, 2-arylbenzofurans, isoflavanol, coumaronochromone 

15 and the like. In plants, these compounds are known to be involved in interactions 
with other organisms and to participate in the defense responses of legumes against 
phytopathogenic microorganisms (Dewick, P. M. (1993) in The Flavonoids, 
Advances in Research Since 1986, Harborne, J. B. Ed., pp. 117-238, Chapman and 
Hall, London). Isoflavonoid-derived compounds also are involved in symbiotic 

20 relationships between roots and rhizobial bacteria which eventually result in 
nodulation and nitrogen-fixation (Phillips, D. A. (1992) in Recent Advances in 
Phytochemistry. Vol. 26, pp 201-231, Stafford, H. A. and Ibrahim, R. K., Eds, 
Plenum Press, New York), and overall they have been shown to act as antibiotics, 
repellents, attractants, and signal compounds (Barz, W. and Welle, R. (1992) 

25 Phenolic Metabolism in Plants, pp 139-164, Ed by H. A. Stafford and R. K. Ibrahim, 
Plenum Press, New York). 

Isoflavonoids have also been reported to have physiological activity in animal 
and human studies. For example, it has been reported that the isoflavones found in 
soybean seeds possess antihemolytic (Nairn, M., et al. (1976) J. Agric. Food Chem. 

30 24:1 174-1 177), antifungal (Nairn, M., et al. (1974) J. Agr. Food Chem. 22:806-810), 
estrogenic (Price, K. R. and Fenwick, G. R. (1985) Food Addit. Contam. 2:73-106), 
tumor-suppressing (Messina, M. and Barnes, S. (1991) J. Natl. Cancer Inst. 
83:541-546; Peterson, G., et al. (1991) Biochem. Biophys. Res. Commun. 
779:661-667), hypolipidemic (Mathur, K., et al. (1964) J. Nutr. 84:201-204), and 

35 serum cholesterol-lowering (Sharma, R. D. (1979) Lipids 74:535-540) effects. 
These studies indicate that isoflavones in soybean protein products may produce 
many significant health benefits. 
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Free isoflavones rarely accumulate to high levels in soybeans. Instead they 
are usually conjugated to carbohydrates or organic acids. Soybean seeds contain 
three types of isoflavones in three different forms: the aglycones, daidzein, genistein 
and glycitein; the glucosides, daidzin, genistin and glycitin; and the 
5 malonylglucosides, 6"-0-malonyldaidzin, 6"-0-malonylgenistin and 6"-0- 
malonylglycitin. During processing acetylglucoside forms are produced: 6'-0- 
acetyldaidzin, 6'-0-acetyl genistin, and 6'-0-acetyl glycitin. The content of 
isoflavonoids in soybean seeds is quite variable and is affected by both genetics 
and environmental conditions such as growing location and temperature during 

10 seed fill (Tsukamoto, C, et al. (1995) J. Agric. Food Chem. 43:1184-1192; Wang, H. 
and Murphy, P. A. (1994) J. Agric. Food Chem. 42:1674-1677). In addition, 
isoflavonoid content in legumes can be stress-induced by pathogen attack, 
wounding, high UV light exposure and pollution (Dixon, R. A. and Paiva, N. L. 
(1995) Plant Cell 7:1085-1097). The genistein isoflavonoid forms make up the most 

15 abundant group in soybean seed and most food products, while daidzein and 
glycitein forms are present in lower levels (Murphy, P.A. (1999) J. Agric. Food 
Chem. 47:2697-2704). 

The biosynthetic pathway for isoflavonoids in soybean and their relationship 
with several other classes of phenylpropanoids is presented in Figure 1 A and 

20 Figure 1B. 

Though the branch initiated by isoflavone synthase that leads to synthesis of 
isoflavonoids is mainly limited to the legumes, the phenylpropanoid pathway and 
other branches occur in other plant species. In maize, genes of the 
phenylpropanoid pathway and the lower anthocyanin branch are regulated by the 
25 transcription factor C1 in combination with an R-type factor. Together C1 and an R- 
type factor activate expression of a set of genes that leads to the synthesis and 
accumulation of anthocyanins in maize cells (Grotewold, E., et al. (1998) Plant Cell 
10:121-140). 

Maize C1 is a myb-type transcription factor that regulates expression of genes 
30 involved in anthocyanin production and accumulation in maize cells. However C1 
cannot activate gene expression alone, and requires interaction with an R-type myc 
transcription factor for activation of target gene promoters. The R-type factors 
include, among others, alleles of R, alleles of the homologous B gene of maize, and 
alleles of the homologous Lc gene. These genes function similarly and make up the 
35 R/B gene family (Goff, S.A., et al. (1992) Genes Dev. 6:864-875). The various 
genes of the R/B gene family may be in turn each found as diverging alleles that 
fluctuate in expression pattern within the corn plant due to differences in their 
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promoters. The members of this family encode proteins with very similar amino acid 
sequences and thus have comparable effects on the anthocyanin pathway structural 
genes. The specificity of the different promoters provides tissue specificity of 
anthocyanin biosynthesis (Radicella, J. P. et al. (1992) Genes Dev. 6:2152-2164; 
5 Walker, E.L. (1995) EMBO J. 74:2350-2363). The skilled artisan will recognize that 
the coding region of any functional gene of this large family could be used in 
conjunction with a promoter of choice to obtain R-gene function in the desired tissue 
or developmental stage. Examples of R/B family genes and alleles include, but are 
not limited to, Lc, R, R-S, R-P, Sn, B-Peru, and B-l. The coding regions of particular 

10 alleles of the Lc or B genes, especially the B-Peru allele, have been most commonly 
used in experiments in conjunction with C1. 

Cell suspension lines of the maize inbred Black Mexican Sweet (BMS) that 
harbored an estradiol-inducible version of a fusion of C1 and R (CRC) were 
analyzed after the addition of estradiol. The cDNA fragments from the known 

15 flavonoid genes, except chalcone isomerase, were induced in the CRC-expressing 
line after hormone induction (Bruce et al. (2000) Plant Cell 12:65-80). Maize C1 and 
an R-type factor together can promote the synthesis of anthocyanins in Arabidopsis 
tissues that do not naturally express anthocyanins (Lloyd, A.M., et al. (1992) 
Science 258:1773-1775), and in petunia leaves (Quattrocchio, F., et al. (1993) Plant 

20 Cell 5:1497-1512). 

WO 99/37794, published July 29, 1999, discloses the expression of maize C1 
and the Lc allele of R in tomato fruit which led to increased levels of the flavonol 
kaempferol. Thus, it is known that C1 and an R-type factor can regulate expression 
of individual genes of the phenylpropanoid pathway in plants including Arabidopsis, 

25 petunia, tomato, and maize leading to production of anthocyanins or flavonols. 
These are all plants that do not produce isoflavones. Isoflavone production is 
almost exclusively limited to the legumes. An example of one of the few non- 
legume plants that does produce isoflavones is sugar beet. 

C1 and B-Peru were transiently expressed in white clover and pea, which are 

30 legumes, (Majnik, et al. (1 998) Aust. J. Plant Phys. 25:335-343) and anthocyanin 
levels assayed by visual inspection. Transient expression of C1 and B-Peru did 
result in production of anthocyanin in several tissues of white clover and pea. No 
assay was performed to determine any effect of C1 and B-Peru on isoflavonoid 
levels. Thus, any effects of C1 and an R-type myc on isoflavonoid levels in 

35 isoflavonoid-producing plants has not been taught. 

WO 00/44909, published August 3, 2000, discloses transformation of 
soybeans with maize C1 and R (as a CRC chimera) in conjunction with 
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overexpression of the isoflavone synthase gene. Any effects of CRC alone on 
levels of isoflavonoids have not been reported. Thus, it is not known whether 
introduction of C1 and an R-type factor alone, without isoflavone synthase, could 
have any effect on the synthesis and accumulation of isoflavonoids in 
5 isoflavonoid-producing plants. 

The physiological benefits associated with isoflavonoids in both plants and 
humans make the manipulation of their contents in crop plants highly desirable. For 
example, increasing levels of isoflavonoids in soybean seeds would increase the 
efficiency of extraction and lower the cost of isoflavone-related products sold today 

10 for use in either reduction of serum cholesterol or in estrogen replacement therapy. 
In addition to altering the levels of total isoflavonoids, altering the ratios of 
individual isoflavonoid components is of interest. There is some indication that 
genistein and daidzein have individual effects in plant disease response and on 
human health. While daidzein is the precursor to the major phytoalexins of 

15 soybean, the glyceollins, genistein is involved in establishing the cell response to 
pathogen attack so that glyceollins are synthesized (Graham and Graham (2000) 
Mol. Plant Microbe Interact. 5:181-219). In human health, daidzein is effective in 
reducing levels of LDL-cholesterol and increasing the levels of HDL-cholesterol in 
human blood (US Patent No. 5,855,892). Daidzein is also effective for the treatment 

20 of hypertension and coronary atherosclerotic heart disease (Liu, Y., et al. (1990) 
Shenyang Yaoxueyuan Xuebao 7:123-125). Thus, raising the daidzein component 
in the total isoflavonoids could be valuable. 

Therefore there is a need to enhance the level of isoflavonoids and to alter the 
ratios of isoflavonoid components in isoflavonoid-producing plants. 

25 SUMMARY OF THE INVENTION 

This invention concerns a method of altering the isoflavonoid profile of an 
isoflavonoid-producing plant, said method comprising: 

(a) transforming a plant with (i) a first recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 

30 a C1 myb transcription factor and a second recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, (ii) a recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 

35 acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 
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an R myc-type transcription factor wherein said construct is capable of functioning 
as both a C1 myb transcription factor and an R myc-type transcription factor; and 
(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; wherein 
5 expression of the construct or constructs alters the isoflavonoid profile of the 

transformed plant by increasing the total daidzein to total genistein ratio compared 
to the total daidzein to total genistein ratio of a control. 

In a second embodiment, the recombinant expression construct described 
above comprises a promoter operably linked to an isolated nucleic acid fragment 
10 encoding a chimeric transcription factor comprising a maize R myc-type coding 
region situated between the C1 DNA binding domain and the C1 activation domain. 

In a third embodiment, the isoflavonoid-producing plant is selected from the 
group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, lupine, 
sugar beet, and snow pea. Also of interest are seed or plant parts of a plant 
15 transformed with a recombinant expression construct of the invention from which 
isoflavonoid-containing products can be obtained or extracted. 

In a fourth embodiment, this invention concerns a food or beverage 
incorporating these isoflavonoid-containing products. 

In a fifth embodiment, this invention concerns a method of producing an 
20 isoflavonoid-containing product which comprises: (a) cracking the seeds obtained 
from plants transformed with any of the recombinant expression constructs of the 
invention to remove the meats from the hulls; and (b) flaking the meats obtained in 
step (a) to obtain the desired flake thickness. 

In a sixth embodiment, this invention concerns an isoflavonoid-producing plant 
25 comprising in its genome 

(i) a first recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a second recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding an R myc-type transcription 

30 factor, 

(ii) a recombinant expression construct comprising a promoter operably linked 
to an isolated nucleic acid fragment encoding a C1 myb transcription factor and a 
promoter operably linked to an isolated nucleic acid fragment encoding an R myc- 
type transcription factor, or 

35 (iii) a recombinant expression construct comprising a promoter operably linked 

to an isolated nucleic acid fragment encoding all or part of a C1 myb transcription 
factor and all or part of an R myc-type wherein said construct is capable of 
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functioning as both a C1 myb transcription factor and an R myc-type transcription 
factor; 

wherein said plant has an increased total daidzein to total genistein ratio when 
compared to the total daidzein to total genistein ratio of a control. 

5 

BIOLOGICAL DEPOSIT 
The following plasmid has been deposited under the terms of the Budapest 
Treaty with the American Type Culture Collection (ATCC), 10801 University 
Boulevard, Manassas, VA 2011 0-2209, and bears the following designation, 
10 accession number and date of deposit. 

Plasmid Accession Number Date of Deposit 

PDP7951 PTA371 07/29/1999 

BRIEF DESCRIPTION OF THE 
DRAWINGS AND SEQUENCE LISTINGS 
15 The invention can be more fully understood from the following detailed 

description and the accompanying Sequence Listing which form a part of this 
application. 

Figure 1A and Figure 1B depict the soybean biosynthetic pathway for 
isoflavonoids and their relationship with several other classes of phenylpropanoids. 

20 Figure 1 A shows the pathway from phenylalanine to daidzein, genistein, and 
dihydroflavonol. Figure 1B shows the pathway from daidzein, genistein, and 
dihydroflavonol to glyceollins, kievitone, anthocyanins, and flavonols. 

Figure 2 depicts the total daidzein to total genistein ratios observed for 
individual R1 seeds from plants obtained from four independent transformation 

25 events showing novel total daidzein to total genistein ratios and from control seeds. 
The source of the seed for each group (i.e. CRC transformation event number or 
control) is indicated above the bars. Control seeds are obtained either from a plant 
which was subject to bombardment and not found to contain the nucleic acid 
fragment of interest or from plants transformed with a recombinant DNA expression 

30 construct that does not alter the isoflavonoid profile of the transformed plant. Seeds 
1,2, 5,6,7, 8,9, 10, 11, 14, 15, 16, 17,21,22, 23,24, 25, 26,27, 28,29,31,35, 
36, 37, 39, 40, and 43 are from plants resulting from transformation experiments 
that, during PCR amplification, were negative for the CRC recombinant expression 
construct Seeds numbered 1 through 7 in Figure 2 of the provisional application 

35 correspond to those numbered 3, 20, 8, 41 , 1 3, 30, and 38 in this figure. 

6 
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Figure 3 depicts the total of isoflavone levels for individual R1 seeds obtained 
from plants from four independent transformation events showing novel total 
daidzein to total genistein ratios and from control seeds. The seeds in this figure 
are the same as those in Figure 2. The source of the seeds for each group (i.e. 
5 control or event number) is indicated above the bars. 

Figure 4 depicts the total daidzein to total genistein ratios observed for single 
R2 seed from field-grown transgenic plants derived from CRC recombinant 
expression construct lines and from wild type segregants (indicated by an asterisk 
[*] on the figure). The seed with novel total daidzein to total genistein ratios also 
10 showed a brown stripe along the median. The source (CRC transformation event 
number) of the seed for each group is indicated above the bars. 

Figure 5 depicts the total daidzein to total genistein ratios observed for single 
R2 seed from a plant derived from the 1-1 transformation event and grown in a 
growth room. Seed without a brown stripe along the median are indicated with a 
15 pound sign (#) above the bars while unmarked bars represent seed with a brown 
stripe along the median. 

Figure 6 depicts the total isoflavone levels observed for single R2 seed from 
field-grown transgenic plants derived from CRC recombinant expression construct 
lines and from wild type segregants (indicated by an asterisk [*] on the figure). The 
20 source of the seed for each group (CRC transformation event number) is indicated 
above the bars. 

Figure 7 depicts the total isoflavone levels in single R2 seed obtained from a 
plant grown in a growth room and derived from the 1-1 transformation event. Seed 
wirhout a brown stripe along the median are indicated with a pound sign (#) above 

25 the bars while the other, unmarked bars represent seed with a brown stripe. These 
represent the same individual seeds as in Figure 5. 

Figure 8 depicts the total daidzein to total genistein ratios of bulk-analyzed R3 
seed harvested from plants grown in a growth room. Each bulk seed sample is from 
a separate plant. The CRC recombinant expression construct line (i.e. CRC 

30 transformation event number) for each seed sample is indicated above the bars. 
Seed samples from wild type segregants derived from the CRC recombinant 
expression construct lines are indicated by an asterisk [*] above the bars. 

Figure 9 depicts the total isoflavone levels of bulk-analyzed R3 seed harvested 
from plants grown in a growth room. Each bulk seed sample is from a separate 

35 plant. The CRC recombinant expression construct line (i.e. CRC transformation 
event number) for each seed sample is indicated above the bars. Seed samples 
from wild type segregants derived from the CRC recombinant expression construct 
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lines are indicated by an asterisk [*] on the figure. The seed are the same as those 
analyzed for Figure 8. 

Figure 10 depicts the totals of individual isoflavones (daidzein, glycitein, and 
genistein) as well as the total isoflavones obtained from HPLC analyses of extracts 
5 prepared from individual R1 seeds obtained from plants transformed with the CRC 
recombinant DNA expression construct. Three to five seeds were analyzed from 
each plant. The control seeds are obtained from a transformant negative for the 
CRC recombinant DNA expression construct. Seeds obtained from plants positive 
for the CRC recombinant DNA expression construct are from individual 

10 transformation events 1-1, 1-2, 1-25, and 1-35. The source of the seeds for each 
group (transformation event followed by plant number) is indicated above the bars. 

Figure 1 1 depicts the ratios of total daidzein to the total isoflavones obtained 
for the same R1 seeds transformed with the CRC recombinant DNA expression 
construct and analyzed in Figure 10. 

15 Figure 12 depicts the ratios of total genistein to the total isoflavones obtained 

for the same R1 seeds transformed with the CRC recombinant DNA expression 
construct and analyzed in Figure 10. 

Figure 13 depicts the ratios of total daidzein to the total isoflavones obtained 
for individual R2 seeds from plants grown in the growth room and derived from three 

20 different transformation events (1-1 , 1-2, and 1-25). The ratios obtained for six 
seeds from each plant are shown. The individual plants from which the seeds were 
harvested are identified with a number letter combination above the bars. The first 
two numbers designate the transformation event number, the third number 
designates the R0 plant, and the letter designates the R1 plant from which the R2 

25 seed were obtained. Seeds not having a brown stripe along the median are 
indicated with a pound sign (#) above the bar. 

Figure 14 depicts the ratios of total genistein to the sum of all isoflavones 
obtained for individual R2 seeds from plants grown in the growth room and derived 
from three different transformation events (1-1 , 1-2, and 1-25). The ratios shown 

30 are for the same seed as shown in Figure 13. 

Figure 15 depicts the ratios of total daidzein to the total isoflavones obtained 
for individual R2 seeds from plants grown in the field and derived from three 
different transformation events (1-1 , 1-2, and 1-25). Each set of 2 seeds labeled 
with an asterisk (*) above the bar are tan seed from a segregant producing only tan 

35 seed, thereby identified as a wt segregant, of the transformation event of the 

adjacent plants. Three seeds all having a brown stripe were assayed from each of 2 
CRC recombinant DNA expression construct -containing plants from each of the 3 
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transformation events. The individual CRC recombinant DNA expression construct- 
containing plants from which the seed were harvested are identified with three 
numbers. The first two numbers designate the transformation event number and the 
third number designates the R0 plant from which the R2 seed were obtained. 
5 Figure 1 6 depicts the ratios of total genistein to the sum of all isoflavones 

obtained for individual R2 seeds from plants grown in the field and derived from 
three different transformation events (1-1, 1-2, and 1-25). The ratios shown are for 
the same seeds as shown in Figure 15. 

Figure 17 depicts the total of isoflavones obtained for individual R2 seeds from 

10 plants grown in the field and derived from three different transformation events (1-1 , 
1-2, and 1-25). The seeds are the same as those analyzed for Figure 1 1 . 

Figure 18 depicts an LC-MS2 mass chromatogram of m/z 504.6 to 505.6 
obtained from extracts from a control wild type segregant seed without the CRC 
recombinant DNA expression construct. 

15 Figure 19 depicts an LC-MS2 mass chromatogram of m/z 504.6 to 505.6 

obtained from extracts from brown striped R3 seed derived from the 1-1 
transformation event. Additional peaks at 14.38, 15.46, 21.29, and 21.75 minutes 
are seen. 

The following sequence descriptions and Sequences Listing attached hereto 
20 comply with the rules governing nucleotide and/or amino acid sequence disclosures 
in patent applications as set forth in 37 C.F.R. §1.821-1.825. The Sequence Listing 
contains the one letter code for nucleotide sequence characters and the three letter 
codes for amino acids as defined in conformity with the IUPAC-IUB standards 
described in Nucleic Acids Research 13:3021 -3030 (1985) and in the Biochemical 
25 Journal 219 (No. 2j:345-373 (1984) which are herein incorporated by reference. 
The symbols and format used for nucleotide and amino acid sequence data comply 
with the rules set forth in 37 C.F.R. §1.822. 

SEQ ID NO:1 is the nucleotide sequence of primer 1 used for detection of the 
CRC recombinant DNA fragment. 
30 SEQ ID NO:2 is the nucleotide sequence of primer 2 used for detection of the 

CRC recombinant DNA fragment. 

SEQ ID NO:3 is the nucleotide sequence of primer 3 used for the detection pf 
genomic and chimeric isoflavone synthase genes. 

SEQ ID NO:4 is the nucleotide sequence of primer 4 used for the detection of 
35 genomic and chimeric isoflavone synthase genes. 
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SEQ ID NO:5 is the nucleotide sequence of the cDNA insert in clone 
sdp3c.pk002.c22 encoding at least a portion of a soybean phenylalanine ammonia 
lyase. 

SEQ ID NO:6 is the nucleotide sequence of the cDNA insert in clone 
5 src3c.pk014.e1 7 encoding at least a portion of a soybean cinnamic acid 4- 
hydroxylase. 

SEQ ID NO:7 is the nucleotide sequence of the cDNA insert in clone 
ssm.pk0013.e3 encoding at least a portion of a soybean chalcone isomerase. 

SEQ ID NO:8 is the nucleotide sequence of the cDNA insert in clone 
10 src3c.pk009.e4 encoding at least a portion of a soybean chalcone reductase. 

SEQ ID NO:9 is the nucleotide sequence of the cDNA insert in clone pOY204 
encoding at least a portion of a soybean isoflavone synthase. 

SEQ ID NO:10 is the nucleotide sequence of the cDNA insert in clone 
sfh.pk0040.g1 1 encoding at least a portion of a soybean flavanone 3-hydroxylase 
15 SEQ ID NO:1 1 is the nucleotide sequence of the cDNA insert in clone 

sfl1.pk131.g5 encoding a portion of a soybean dihydroflavonol reductase. 

SEQ ID NO: 12 is the nucleotide sequence of the cDNA insert in clone 
src.pk0043.d1 1 encoding at least a portion of a soybean dihydroflavonol reductase. 

SEQ ID NO:13 is the nucleotide sequence of the cDNA insert in clone 
20 ssl.pk0057.d1 2 encoding at least a portion of a soybean flavonol synthase. 

SEQ ID NO:14 is the nucleotide sequence of the cDNA insert in clone 
srr1c.pk001 .k4 encoding at least a portion of a soybean isoflavone reductase. 

SEQ ID NO:15 is the nucleotide sequence of primer5 used for the preparation 
of an isoflavone synthase sequence by amplification from clone pOY204. 
25 SEQ ID NO: 16 is the nucleotide sequence of primer6 used for the preparation 

of an isoflavone synthase sequence by amplification from clone pOY204. 

DETAILED DESCRIPTION OF THE INVENTION 
All patents, patent applications and publications cited are incorporated herein 
30 by reference in their entirety. 

In the context of this disclosure, a number of terms shall be utilized. 
The term "isoflavonoid(s)"refers to a large group of polyphenolic compounds, 
based on a common diphenylpropane skeleton, which occur naturally in plants. 
This term, as used herein, includes, but is not limited to, the three types of 
35 isoflavones in three different forms: the aglycones, daidzein, genistein and glycitein; 
the glucosides, daidzin, genistin and glycitin; and the malonylglucosides, 6"-0- 
malonyldaidzin, 6"-0-malonylgenistin and 6"-0-malonylgIycitin, as well as, the 
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acetylglucoside forms: 6'-0-acetyldaidzin, 6'-0-acetyI genistin, and 6'-0-acetyl 
glycitin that are formed during processing. 

As used herein, "total genistein" refers to the total amount of this isoflavonoid 
regardless of the form. Thus, "total genistein" includes the aglycone form, the 
5 glucoside form, the malonylglucoside form, and other genistein forms. Likewise, 
"total daidzein" refers to the total amount of this isoflavonoid regardless of the form. 
Thus, "total daidzein" includes the aglycone form, the glucoside form, the 
malonylglucoside form, and other daidzein forms, and "total glycitein" includes the 
aglycone form, the glucoside form, the malonylglucoside form, and other glycitein 
10 forms. 

The term "isoflavonoid-producing plant" refers to a plant in which isoflavonoids 
normally occur. 

The term "control" refers to a plant or plant parts, such as seed, which is/are 
used as the basis for comparison. The control plant or plant parts, such as seed, 

15 described herein are plants or plant parts in which the isoflavone profile has not 
been altered. Examples of suitable controls include, but are not limited to, a wild- 
type plant or plant parts obtained from a wild type plant; a plant which was subject to 
bombardment and not found to contain the nucleic acid fragment or fragments of 
interest or a plant part, such as a seed or seeds, obtained from such a transformed 

20 plant; a control plant or plant part can be one derived from a transformed plant that 
contains the nucleic acid fragment or fragments of interest, but it does not now 
contain the nucleic acid fragment or fragments of interest due to segregation of the 
fragments(s) during sexual reproduction (this can be referred to as a wild-type 
segregant); or a control plant can be a plant transformed with a nucleic acid 

25 fragment that does not alter the isoflavone profile, e.g., a plant transformed to 

produce seeds with a high lysine phenotype but the isoflavone profile would not be 
altered. For example, if the plant of interest is a soybean plant then the preferred 
control would be seeds obtained from one of the plants described above. If the 
plant of interest is clover, then the preferred control would be leaves obtained from 

30 one of the plants described above. Those skilled in the art will appreciate that a 
particular control will depend upon the plant of interest. 

The term "C1 myb transcription factor" refers to a protein encoded by a maize 
C1 gene and to any protein which is functionally equivalent to a C1 myb 
transcription factor. 

35 The term "R myc-type transcription factor refers to a protein with a basic helix- 

loop-helix domain encoded by a member of the R/B gene family and to any protein 
that is functionally equivalent to an R myc-type transcription factor. 
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As used herein, an "isolated nucleic acid fragment" is a polymer of RNA or 
DNA that is single- or double-stranded, optionally containing synthetic, non-natural 
or altered nucleotide bases. An isolated nucleic acid fragment in the form of a 
polymer of DNA may be comprised of one or more segments of cDNA, genomic 
5 DNA or synthetic DNA. Nucleotides (usually found in their ^-monophosphate form) 
are referred to by their single letter designation as follows: "A" for adenylate or 
deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deoxycytidylate, 
"G" for guanylate or deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" 
for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, 

10 "I" for inosine, and "N" for any nucleotide. 

The terms "subfragment that is functionally equivalent" and "functionally 
equivalent subfragment" are used interchangeably herein. These terms refer to a 
portion or subsequence of an isolated nucleic acid fragment in which the ability to 
alter gene expression or produce a certain phenotype is retained whether or not the 

15 fragment or subfragment encodes an active enzyme. For example, the fragment or 
subfragment can be used in the design of recombinant DNA fragments or chimeric 
genes to produce the desired phenotype in a transformed plant. 

The terms "homology", "homologous", "substantially similar" and 
"corresponding substantially" are used interchangeably herein. They refer to nucleic 

20 acid fragments wherein changes in one or more nucleotide bases do not affect the 
ability of the nucleic acid fragment to mediate gene expression or produce a certain 
phenotype. These terms also refer to modifications of the nucleic acid fragments of 
the instant invention such as deletion or insertion of one or more nucleotides that do 
not substantially alter the functional properties of the resulting nucleic acid fragment 

25 relative to the initial, unmodified fragment. It is therefore understood, as those 
skilled in the art will appreciate, that the invention encompasses more than the 
specific exemplary sequences. 

Moreover, the skilled artisan recognizes that substantially similar nucleic acid 
sequences encompassed by this invention are also defined by their ability to 

30 hybridize, under moderately stringent conditions (for example, 0.5 X SSC, 0.1 % 
SDS, 60°C) with the sequences exemplified herein, or to any portion of the 
nucleotide sequences disclosed herein and which are functionally equivalent to any 
of the nucleic acid sequences disclosed herein. Stringency conditions can be 
adjusted to screen for moderately similar fragments, such as homologous 

35 sequences from distantly related organisms, to highly similar fragments, such as 
genes that duplicate functional enzymes from closely related organisms. Post- 
hybridization washes determine stringency conditions. One set of preferred 
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conditions involves a series of washes starting with 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, 
and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 30 min. A more 
preferred set of stringent conditions involves the use of higher temperatures in 
5 which the washes are identical to those above except for the temperature of the final 
two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60°C. Another 
preferred set of highly stringent conditions involves the use of two final washes in 
0.1X SSC, 0.1% SDS at 65°C. 

Sequence alignments and percent similarity calculations may be determined 

10 using a variety of comparison methods designed to detect homologous sequences 
including, but not limited to, the Megalign program of the LASARGENE 
bioinformatics computing suite (DNASTAR Inc., Madison, Wl). Multiple alignment of 
the sequences are performed using the Clustal method of alignment (Higgins and 
Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, 

15 GAP LENGTH PENALTY=10). Default parameters for pairwise alignments and 
calculation of percent identity of protein sequences using the Clustal method are 
KTUPLE=1 , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For 
nucleic acids these parameters are KTUPLE=2, GAP PENALTY=5, WINDOW=4 
and DIAGONALS SAVED=4. 

20 "Gene" refers to a nucleic acid fragment that expresses a specific protein, 

including regulatory sequences preceding (5' non-coding sequences) and following 
(3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as 
found in nature with its own regulatory sequences. "Chimeric gene" refers any gene 
that is not a native gene, comprising regulatory and coding sequences that are not 

25 found together in nature. Accordingly, a chimeric gene may comprise regulatory 
sequences and coding sequences that are derived from different sources, or 
regulatory sequences and coding sequences derived from the same source, but 
arranged in a manner different than that found in nature. A "foreign gene" refers to 
a gene not normally found in the host organism, but that is introduced into the host 

30 organism by gene transfer. Foreign genes can comprise native genes inserted into 
a non-native organism, or chimeric genes. A "transgene" is a gene that has been 
introduced into the genome by a transformation procedure. An "allele" is one of 
several alternative forms of a gene occupying a given locus on a chromosome. 
When all the alleles present at a given locus on a chromosome are the same that 

35 plant is homozygous at that locus. If the alleles present at a given locus on a 
chromosome differ that plant is heterozygous at that locus. 
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"Coding sequence" refers to a DNA sequence that codes for a specific amino 
acid sequence. "Regulatory sequences" refer to nucleotide sequences located 
upstream (5' non-coding sequences), within, or downstream (3' non-coding 
sequences) of a coding sequence, and which influence the transcription, RNA 
5 processing or stability, or translation of the associated coding sequence. Regulatory 
sequences may include, but are not limited to, promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the expression of 
a coding sequence or functional RNA. The promoter sequence consists of proximal 

10 and more distal upstream elements, the latter elements often referred to as 

enhancers. Accordingly, an "enhancer" is a DNA sequence which can stimulate 
promoter activity and may be an innate element of the promoter or a heterologous 
element inserted to enhance the level or tissue-specificity of a promoter. Promoters 
may be derived in their entirety from a native gene, or be composed of different 

15 elements derived from different promoters found in nature, or even comprise 

synthetic DNA segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental 
conditions. Promoters which cause a gene to be expressed in most cell types at 

20 most times are commonly referred to as "constitutive promoters". New promoters of 
various types useful in plant cells are constantly being discovered; numerous 
examples may be found in the compilation by Okamuro, J. K., and Goldberg, R. B. 
(1989) Biochemistry of Plants 75:1-82. It is further recognized that since in most 
cases the exact boundaries of regulatory sequences have not been completely 

25 defined, DNA fragments of some variation may have identical promoter activity. 

The "translation leader sequence" refers to a polynucleotide sequence located 
between the promoter sequence of a gene and the coding sequence. The 
translation leader sequence is present in the fully processed mRNA upstream of the 
translation start sequence. The translation leader sequence may affect processing 

30 of the primary transcript to mRNA, mRNA stability or translation efficiency. 

Examples of translation leader sequences have been described (Turner, R. and 
Foster, G. D. (1995) Mol. Biotech. 3:225-236). 

The "3" non-coding sequences" refer to DNA sequences located downstream 
of a coding sequence and include polyadenylation recognition sequences and other 

35 sequences encoding regulatory signals capable of affecting mRNA processing or 
gene expression. The polyadenylation signal is usually characterized by affecting 
the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The 
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use of different 3' non-coding sequences is exemplified by Ingelbrecht, I. L, et al. 
(1989) Plant Cell 7:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
5 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary transcript 
or it may be an RNA sequence derived from post-transcriptional processing of the 
primary transcript and is referred to as the mature RNA. "Messenger RNA (mRNA)" 
refers to the RNA that is without introns and that can be translated into protein by 

10 the cell. "cDNA" refers to a DNA that is complementary to and synthesized from a 
mRNA template using the enzyme reverse transcriptase. The cDNA can be single- 
stranded or converted into the double-stranded form using the Klenow fragment of 
DNA polymerase I. "Sense" RNA refers to RNA transcript that includes the mRNA 
and can be translated into protein within a cell or in vitro. "Antisense RNA" refers to 

15 an RNA transcript that is complementary to all or part of a target primary transcript 
or mRNA and that blocks the expression of a target gene (U.S. Patent 
No. 5,107,065). The complementarity of an antisense RNA may be with any part of 
the specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding 
sequence, introns, or the coding sequence. "Functional RNA" refers to antisense 

20 RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect 
on cellular processes. The terms "complement" and "reverse complement" are used 
interchangeably herein with respect to mRNA transcripts, and are meant to define 
the antisense RNA of the message. 

The term "operably linked" refers to the association of nucleic acid sequences 

25 on a single nucleic acid fragment so that the function of one is regulated by the 

other. For example, a promoter is operably linked with a coding sequence when it is 
capable of regulating the expression of that coding sequence (i.e., that the coding 
sequence is under the transcriptional control of the promoter). Coding sequences 
can be operably linked to regulatory sequences in a sense or antisense orientation. 

30 In another example, the complementary RNA regions of the invention can be 

operably linked, either directly or indirectly, 5' to the target mRNA, or 3' to the target 
mRNA, or within the target mRNA, or a first complementary region is 5' and its 
complement is 3' to the target mRNA. 

The term "expression", as used herein, refers to the production of a functional 

35 end-product, e.g., an mRNA or a protein (precursor or mature). 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one 
from which any pre- or propeptides present in the primary translation product have 
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been removed. "Precursor" protein refers to the primary product of translation of 
mRNA; i.e., with pre- and propeptides still present. Pre- and propeptides may be 
but are not limited to intracellular localization signals. 

"Stable transformation" refers to the transfer of a nucleic acid fragment into a 
5 genome of a host organism, including both nuclear and organellar genomes, 
resulting in genetically stable inheritance. In contrast, "transient transformation" 
refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing 
organelle, of a host organism resulting in gene expression without integration or 
stable inheritance. Host organisms containing the transformed nucleic acid 

10 fragments are referred to as "transgenic" organisms. The preferred method of cell 
transformation of rice, corn and other monocots is the use of particle-accelerated or 
"gene gun" transformation technology (Klein et al., (1987) Nature (London) 
327:70-73; U.S. Patent No. 4,945,050), or an >4grabacre/7t/m-mediated method 
using an appropriate Ti plasmid containing the transgene (Ishida Y. et al., 1996, 

15 Nature Biotech. 74:745-750). 

Standard recombinant DNA and molecular cloning techniques used herein are 
well known in the art and are described more fully in Sambrook, J., Fritsch, E.F. and 
Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory Press: Cold Spring Harbor, 1989 (hereinafter "Sambrook"). 

20 The term "recombinant" refers to an artificial combination of two otherwise 

separated segments of sequence, e.g., by chemical synthesis or by the 
manipulation of isolated segments of nucleic acids by genetic engineering 
techniques. 

"PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of large 
25 quantities of specific DNA segments, consists of a series of repetitive cycles (Perkin 
Elmer Cetus Instruments, Norwalk, CT). Typically, the double stranded DNA is heat 
denatured, the two primers complementary to the 3' boundaries of the target 
segment are annealed at low temperature and then extended at an intermediate 
temperature. One set of these three consecutive steps is referred to as a cycle. 
30 A "recombinant DNA fragment" refers to an artificial combination of nucleic 

acid fragments that are not found together in nature, e.g. coding sequences and 
non-regulatory sequences. Thus, the difference between a "recombinant DNA 
fragment" and a "recombinant construct" as defined below turns on the presence or 
absence of regulatory sequences in the artificial combination of nucleic acid 
35 sequences. If a regulatory sequence is part of the combination then it is a 

"recombinant construct". If there are no regulatory sequences in the combination, 
then it is a "recombinant DNA fragment". 
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The terms "recombinant construct", "expression construct", "chimeric 
construct", "construct" and "recombinant expression construct" are used 
interchangeably herein. A recombinant construct comprises an artificial combination 
of nucleic acid fragments, e.g., regulatory and coding sequences that are not found 
5 together in nature. For example, a chimeric construct may comprise at least one 
regulatory sequence and at least one coding sequence that are derived from 
different sources, or regulatory sequences and coding sequences derived from the 
same source, but arranged in a manner different than that found in nature. Such 
construct may be used by itself or may be used in conjunction with a vector. If a 

10 vector is used then the choice of vector is dependent upon the method that will be 
used to transform host cells as is well known to those skilled in the art. For 
example, a plasmid vector can be used. The skilled artisan is well aware of the 
genetic elements that must be present on the vector in order to successfully 
transform, select and propagate host ceils comprising any of the isolated nucleic 

15 acid fragments of the invention. The skilled artisan will also recognize that different 
independent transformation events will result in different levels and patterns of 
expression (Jones et al., (1985) EMBO J. 4:2.41 1-2418; De Almeida et al., (1989) 
Mol. Gen. Genetics 218:78-86), and thus that multiple events must be screened in 
order to obtain lines displaying the desired expression level and pattern. Such 

20 screening may be accomplished by Southern analysis of DNA, Northern analysis of 
mRNA expression, immunoblotting analysis of protein expression, or phenotypic 
analysis, among others. 

The present invention concerns a method of altering the isoflavonoid profile of 
an isoflavonoid-producing plant, said method comprising: 

25 (a) transforming a plant with (i) a first recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a second recombinant expression construct 
comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, (ii) a recombinant expression construct 

30 comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 
acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 

35 an R myc-type transcription factor wherein said construct is capable of functioning 
as both a C1 myb transcription factor and an R myc-type transcription factor; and 
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(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; wherein 
expression of the construct or constructs alters the isoflavonoid profile of the 
transformed plant by increasing the total daidzein to total genistein ratio compared 
5 to the total daidzein to total genistein ratio of a control. 

Also of interest are isoflavonoid-producing plants comprising in their genome 
(i) a first recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a second recombinant expression construct comprising a promoter operably 

10 linked to an isolated nucleic acid fragment encoding an R myc-type transcription 
factor, (ii) a recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding a C1 myb transcription factor 
and a promoter operably linked to an isolated nucleic acid fragment encoding an R 
myc-type transcription factor, or (iii) a recombinant expression construct comprising 

15 a promoter operably linked to an isolated nucleic acid fragment encoding all or part 
of a C1 myb transcription factor and all or part of an R myc-type wherein said 
construct is capable of functioning as both a C1 myb transcription factor and an R 
myc-type transcription factor; wherein said plant has an increased total daidzein to 
total genistein ratio when compared to the total daidzein to total genistein ratio of a 

20 control. 

Examples of isoflavonoid-producing plants include, but are not limited to, 
soybean, clover, mung bean, lentil, hairy vetch, alfalfa, lupine, sugar beet, and snow 
pea. In a more preferred embodiment, the preferred isoflavonoid-producing plant 
would be soybean. Examples of other isoflavonoid-producing plants can be found in 
25 WO 93/23069, published November 25, 1993, the disclosure of which is hereby 
incorporated by reference. 

Transformation methods are well known to those skilled in the art and are 
described above. 

The recombinant expression constructs which can be used to transform an 
30 isoflavonoid-producing plant fall into one of three categories: 

(1) the constructs can be entirely separate, e.g., one construct may comprise a 
promoter operably linked to an isolated nucleic acid fragment encoding a C1 myb 
transcription factor and another separate construct may comprise a promoter 
operably linked to an isolated nucleic acid fragment encoding an R-myc type 

35 transcription factor; 

(2) a single construct comprising a promoter operably linked to an isolated 
nucleic acid fragment encoding a C1 myb transcription factor and a promoter 
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operably linked to an isolated nucleic acid fragment encoding an R-myc type 
transcription factor; or 

(3) a single construct comprising a promoter operably linked to an isolated 
nucleic acid fragment encoding all or a part of a C1 myb transcription factor and an 
5 isolated nucleic acid fragment encoding all or a part of an R-myc type transcription 
factor such that a fusion protein combining the two encoded proteins is produced. 

The transformed plant is then grown under conditions suitable for the 
expression of the recombinant expression construct or constructs. Expression of 
the recombinant expression construct or constructs alters the isoflavonoid profile of 

10 the transformed plant or plant part by increasing the total daidzein to total genistein 
ratio compared to the total daidzein to total genistein ratio of an untransformed plant 
or plant part. For example, in some cases it may be preferrable to examine 
expression of a recombinant expression construct by comparing seeds obtained 
from a transformed plant with seeds obtained from an untransformed plant to 

15 determine if there has been an increase in the total daidzein to total genistein ratio. 
In a more preferred, embodiment, an isoflavonoid-producing plant can be 
transformed with a recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a chimeric 
transcription factor comprising the maize R coding region situated between the C1 

20 DNA binding domain and the C1 activation domain. 

The regeneration, development and cultivation of plants from single plant 
protoplast transformants or from various transformed explants is well known in the 
art (Weissbach and Weissbach, In.: Methods for Plant Molecular Biology, (Eds.), 
Academic Press, Inc., San Diego, CA (1988)). This regeneration and growth 

25 process typically includes the steps of selection of transformed cells, culturing those 
individualized cells through the usual stages of embryonic development through the 
rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. 
The resulting transgenic rooted shoots are thereafter planted in an appropriate plant 
growth medium such as soil. 

30 The development or regeneration of plants containing the foreign, exogenous 

gene that encodes a protein of interest is well known in the art. Preferably, the 
regenerated plants are self-pollinated to provide homozygous transgenic plants. 
Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown 
plants of agronomically important lines. Conversely, pollen from plants of these 

35 important lines is used to pollinate regenerated plants. A transgenic plant of the 
present invention containing a desired polypeptide is cultivated using methods well 
known to one skilled in the art. 
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There are a variety of methods for the regeneration of plants from plant tissue. 
The particular method of regeneration will depend on the starting plant tissue and 
the particular plant species to be regenerated. 

Methods for transforming dicots, primarily by use of Agrobacterium 
5 tumefaciens, and obtaining transgenic plants have been published for cotton (U.S. 
Patent No. 5,004,863, U.S. Patent No. 5,159,135, U.S. Patent No. 5,518, 908); 
soybean (U.S. Patent No. 5,569,834, U.S. Patent No. 5,416,011, McCabe et. al., 
BiolTechnology 6:923 (1988), Christou et al., Plant Physiol. 87:671-674 (1988)); 
Brassica (U.S. Patent No. 5,463,174); peanut (Cheng et al., Plant Cell Rep. 

10 15:653-657 (1996), McKently et al., Plant Cell Rep. 14:699-703 (1995)); papaya; 
and pea (Grant et al., Plant Cell Rep. 15:254-258, (1995)). 

Assays for gene expression based on the transient expression of cloned 
nucleic acid constructs have been developed by introducing the nucleic acid 
molecules into plant cells by polyethylene glycol treatment, electroporation, or 

15 particle bombardment (Marcotte et al., Nature 335:454-457 (1 988); Marcotte et al., 
Plant Cell 1 :523-532 (1989); McCarty etal., Cell 66:895-905 (1991); Hattori et al., 
Genes Dev. 6:609-618 (1992); Goff et al., EMBOJ. 9:2517-2522 (1990)). 

Transient expression systems may be used to functionally dissect gene 
constructs (see generally, Maliga et al., Methods in Plant Molecular Biology, Cold 

20 Spring Harbor Press (1 995)). It is understood that any of the nucleic acid molecules 
of the present invention can be introduced into a plant cell in a permanent or 
transient manner in combination with other genetic elements such as vectors, 
promoters, enhancers etc. 

In addition to the above discussed procedures, practitioners are familiar with 

25 the standard resource materials which describe specific conditions and procedures 
for the construction, manipulation and isolation of macromolecules (e.g., DNA 
molecules, plasmids, etc.), generation of recombinant DNA fragments and 
recombinant expression constructs and the screening and isolating of clones, (see 
for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 

30 Harbor Press (1989); Maliga et al., Methods in Plant Molecular Biology, Cold Spring 
Harbor Press (1995); Birren et al., Genome Analysis: Detecting Genes, 1, Cold 
Spring Harbor, New York (1998); Birren et al., Genome Analysis: Analyzing DNA, 2, 
Cold Spring Harbor, New York (1998); Plant Molecular Biology: A Laboratory 
Manual, eds. Clark, Springer, New York (1997)). 

35 Any promoter can be used in the method of the invention. Thus, the origin of 

the promoter chosen to drive expression of the coding sequence is not critical as 
along as it has sufficient transcriptional activity to accomplish the invention by 
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expressing translatable mRNA for the desired protein genes in the desired host 
tissue. In a preferred embodiment, the promoter is a seed-specific promoter. 
Examples of a seed-specific promoter include, but are not limited to, the promoter 
for p-conglycinin (Chen et al. (1989) Dev. Genet. 10: 112-122), the napin and 
5 phaseolin promoters. A plethora of promoters are described in WO 00/18963, 
published on April 6, 2000, the disclosure of which is hereby incorporated by 
reference. 

Also within the scope of this invention are seeds or plant parts obtained from 
such transformed plants. Plant parts include differentiated and undifferentiated 
10 tissues, including but not limited to, roots, stems, shoots, leaves, pollen, seeds, 

tumor tissue, and various forms of cells and culture such as single cells, protoplasts, 
embryos, and callus tissue. The plant tissue may be in plant or in organ, tissue or 
cell culture. 

In another aspect, this invention concerns an isoflavonoid-containing product 

15 high in total daidzein and low in total genistein obtained from the seeds or plant 
parts obtained from the transformed plants described herein. Examples of such an 
isoflavonoid-containing product include, but are not limited to, protein isolate, protein 
concentrate, meal, grits, full fat and defatted flours, textured proteins, textured 
flours, textured concentrates and textured isolates. In still another aspect, this 

20 invention concerns an isoflavonoid-containing product high in total daidzein and low 
in total genistein extracted from the seeds or plant parts obtained from the 
transformed plants described herein. An extracted product could then used in the 
production of pills, tablets, capsules or other similar dosage forms made to contain a 
high concentration of isoflavones. 

25 Methods for obtaining such products are well known to those skilled in the art. 

For example, in the case of soybean, such products can be obtained in a variety of 
ways. Conditions typically used to prepare soy protein isolates have been 
described by [Cho, et al, (1981) U.S. Patent No. 4,278,597; Goodnight, et al. (1978) 
U.S. Patent No. 4,072,670]. Soy protein concentrates are produced by three basic 

30 processes: acid leaching (at about pH 4.5), extraction with alcohol (about 55-80%), 
and denaturing the protein with moist heat prior to extraction with water. Conditions 
typically used to prepare soy protein concentrates have been described by Pass 
[(1975) U.S. Patent No. 3,897,574] and Campbell et al. [(1985) in New Protein 
Foods, ed. by Altschul and Wilcke, Academic Press, Vol. 5, Chapter 10, Seed 

35 Storage Proteins, pp 302-338]. 

"Isoflavone-containing protein products" can be defined as those items 
produced from seed of a suitable plant which are used in feeds, foods and/or 
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beverages. For example, "soy protein products"can include, but are not limited to, 
those items listed in Table 1. "Soy protein products". 



Soy Protein Products Derived from Soybean Seeds 3 
Whole Soybean Products Processed Soy Protein Products 

Roasted Soybeans Full Fat and Defatted Flours 

Baked Soybeans Soy Grits 

Soy Sprouts Soy Hypocotyls 

Soy Milk Soybean Meal 

Soy Milk 

Specialty Sov Foods/Ingredients Soy Protein Isolates 

Soy Milk Soy Protein Concentrates 

Tofu Textured Soy Proteins 

Tempeh Textured Flours and Concentrates 

Miso Textured Concentrates 

Soy Sauce Textured Isolates 

Hydrolyzed Vegetable Protein 

Whipping Protein 

a See Soy Protein Products: Characteristics, Nutritional Aspects and Utilization 
(1987). Soy Protein Council 

"Processing" refers to any physical and chemical methods used to obtain the 
products listed in Table 1 and includes, but is not limited to, heat conditioning, 
flaking and grinding, extrusion, solvent extraction, or aqueous soaking and 
extraction of whole or partial seeds. Furthermore, "processing" includes the 
methods used to concentrate and isolate soy protein from whole or partial seeds, as 
well as the various traditional Oriental methods in preparing fermented soy food 
products. Trading Standards and Specifications have been established for many of 
these products (see National Oilseed Processors Association Yearbook and Trading 
Rules 1991-1992). Products referred to as being "high protein" or "low protein" are 
those as described by these Standard Specifications. "NSI" refers to the Nitrogen 
Solubility Index as defined by the American Oil Chemists' Society Method Ac4 41. 
"KOH Nitrogen Solubility" is an indicator of soybean meal quality and refers to the 
amount of nitrogen soluble in 0.036 M KOH under the conditions as described by 
Araba and Dale [(1990) Poult. Sci. 69:76-83]. "White" flakes refer to flaked, 
dehulled cotyledons that have been defatted and treated with controlled moist heat 
to have an NSI of about 85 to 90. This term can also refer to a flour with a similar 
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NSI that has been ground to pass through a No. 100 U.S. Standard Screen size. 
"Cooked" refers to a soy protein product, typically a flour, with an NSI of about 20 to 
60. "Toasted" refers to a soy protein product, typically a flour, with an NSI below 20. 
"Grits" refer to defatted, dehulled cotyledons having a U.S. Standard screen size of 
5 between No. 10 and 80. "Soy Protein Concentrates" refer to those products 

produced from dehulled, defatted soybeans by three basic processes: acid leaching 
(at about pH 4.5), extraction with alcohol (about 55-80%), and denaturing the protein 
with moist heat prior to extraction with water. Conditions typically used to prepare 
soy protein concentrates have been described by Pass [(1975) U.S. Patent No. 

10 3,897,574; Campbell et al., (1 985) in New Protein Foods, ed. by Altschul and 

Wilcke, Academic Press, Vol. 5, Chapter 10, Seed Storage Proteins, pp 302-338]. 
"Extrusion" refers to processes whereby material (grits, flour or concentrate) is 
passed through a jacketed auger using high pressures and temperatures as a 
means of altering the texture of the material. "Texturing" and "structuring" refer to 

15 extrusion processes used to modify the physical characteristics of the material. The 
characteristics of these processes, including thermoplastic extrusion, have been 
described previously [Atkinson (1970) U.S. Patent No. 3,488,770, Horan (1985) In 
New Protein Foods, ed. by Altschul and Wilcke, Academic Press, Vol. 1 A, 
Chapter 8, pp 367-414], Moreover, conditions used during extrusion processing of 

20 complex foodstuff mixtures that include soy protein products have been described 
previously [Rokey (1983) Feed Manufacturing Technology III, 222-237; McCulloch, 
U.S. Patent No. 4,454,804]. 

Also, within the scope of this invention are food and beverages which have 
incorporated therein an isoflavonoid-containing product of the invention. 

25 The beverage can be a liquid or in a dry powdered form. 

The foods to which the isoflavonoid-containing product of the invention can be 
incorporated/added include almost all foods/beverages. For example, there can be 
mentioned meats such as ground meats, emulsified meats, marinated meats, and 
meats injected with an isoflavonoid-containing product of the invention; nutritional 

30 supplements; beverages such as nutritional beverages, sports beverages, protein 
fortified beverages, juices, milk, milk alternatives, and weight loss beverages; 
cheeses such as hard and soft cheeses, cream cheese, and cottage cheese; frozen 
desserts such as ice cream, ice milk, low fat frozen desserts, and non-dairy frozen 
desserts; yogurts; soups; puddings; bakery products; and salad dressings; and dips 

35 and spreads such as mayonnaise; and chip dips; and food bars. The 

isoflavonoid-containing product can be added in an amount selected to deliver a 
desired dose to the consumer of the food and/or beverage. 
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In still another aspect this invention concerns a method of producing an 
isoflavonoid-containing product which comprises: (a) cracking the seeds obtained 
from transformed plants of the invention to remove the meats from the hulls; and 
(b) flaking the meats obtained in step (a) to obtain the desired flake thickness. 

5 

EXAMPLES 

The present invention is further defined in the following Examples, in which 
parts and percentages are by weight and degrees are Celsius, unless otherwise 
stated. It should be understood that these Examples, while indicating preferred 

10 embodiments of the invention, are given by way of illustration only. From the above 
discussion and these Examples, one skilled in the art can ascertain the essential 
characteristics of this invention, and without departing from the spirit and scope 
thereof, can make various changes and modifications of the invention to adapt it to 
various usages and conditions. Thus, various modifications of the invention in 

15 addition to those shown and described herein will be apparent to those skilled in the 
art from the foregoing description. Such modifications are also intended to fall 
within the scope of the appended claims. 

EXAMPLE 1 

Construction of Plasmids for Transformation of Glycine max 
20 The effect on the isoflavonoid profile of soybean of a protein encoded by a 

recombinant DNA fragment containing maize nucleotide sequences encoding C1 
and the Lc allele of R was tested. For this purpose, plasmid pOY203 was 
constructed for introduction of a CRC recombinant expression construct into 
soybean embryos. Plasmid pOY203 was briefly described in PCT publication 
25 WO 00/44090 (published August 3, 2000) and contains a CRC recombinant DNA 
fragment under the control of the phaseolin promoter and termination signals in a 
vector containing expression systems which allow for selection for growth in the 
presence of hygromycin in both bacterial and plant systems. 

Plasmid pOY203 was prepared through an intermediary plasmid pOY135. 
30 Plasmid pOY135 contains, flanked by Hind INI restriction endonuclease sites, the 
CRC recombinant DNA fragment inserted between the phaseolin promoter and 
polyadenylation signal sequences. The CRC recombinant DNA fragment contains, 
between Sma I sites and in the 5' to 3' orientation, maize nucleotide sequences 
encoding 

35 (a) the C1 myb domain to amino acid 1 25; 

(b) the entire coding region of the Lc allele of R (amino acids 1 through 160); 

and 
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(c) the C1 transcription activation domain (from amino acid 126 to the 
C-terminus of C1). 

The CRC recombinant DNA fragment was isolated from plasmid pDP7951 
(described in PCT Publication WO 00/44090, published August 3, 2000, and bearing 
5 ATCC deposit No. PTA371 ) and inserted into vector pCW1 08N. Vector pCW108N 
is derived from the commercially-available vector pUC18 (Gibco-BRL) and contains 
between Hind III sites: 

(a) a DNA fragment of the phaseolin gene promoter extending from -410 to 
+77 relative to the transcription start site (Slightom et al. (1991) Plant Mol. Biol. Man. 

10 B16:1);and 

(b) a 1175 bp DNA fragment including the polyadenylation signal sequence 
region of the same phaseolin gene (see sequence descriptions in Doyle et al. (1986) 
J. Biol. Chem. 267:9228-9238 and Slightom et al. (1983) Proc. Natl. Acad. Sci. USA 
80:1897-1901). 

15 Plasmid pCW108N was digested with Asp 718, which cuts between the 

phaseolin promoter and polyadenylation signal sequence, and the protruding ends 
filled-in by incubation with T4 DNA polymerase in the presence of dATP, dCTP, 
dGTP, and dTTP. The DNA fragment containing the CRC recombinant DNA 
fragment was isolated from pDP7951 by digestion with Sma I, purified by agarose 

20 gel electrophoresis, and inserted into the blunt-ended pCW108N to create pOY135. 
To create pOY203, a cassette containing the phaseolin promoter/CRC 
recombinant DNA fragment/phaseolin polyadenylation signal sequence (herein 
referred to as CRC recombinant expression construct) was liberated from pOY135 
by digestion with Hind III and introduced into Hind Ill-digested pZBL102. Plasmid 

25 pZBL102 contains expression systems which allow for selection for growth in the 
presence of hygromycin to be used as a means of identifying cells that contain 
plasmid DNA sequences in both bacterial and plant systems and is described in 
PCT Publication WO 00/44090. 

Even though it is not necessary for the practice of the invention, in the original 

30 experiment, plasmid pOY203 was co-bombarded into soybean embryos with 
plasmid pWSJ001 also described in PCT publication WO 00/44090. Plasmid 
pWSJ001 contains the isoflavone synthase coding region under the contra! of the 
alpha' beta-conglycinin promoter and phaseolin polyadenylation signal sequence in 
a vector containing expression systems which allow for selection for growth in the 

35 presence of hygromycin in both bacterial and plant systems. The isoflavone 
synthase coding region (found in NCBI General Identifier No. 6979520) was 
obtained by PGR amplification of a clone (sgs1c.pk006.o20) obtained from a 
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soybean cDNA library prepared from seeds germinated for 4 hours. Amplification 
was performed using Pfu polymerase (Stratagene) in a standard PCR reaction in a 
GeneAmp PCR System with primer5 (shown in SEQ ID NO: 15) and primer6 (shown 
inSEQIDNO:16). 

5 5'-TTGCTGGAACTTGCACTTGGT-3' [SEQ ID NO: 15] 

5'-GTATATGATGGGTACCTTAATTAAGAAAGGAG-3' [SEQ ID NO: 1 6] 
The isoflavone synthase coding region was first inserted between the alpha' 
beta-conglycinin promoter and phaseolin polyadenylation signal sequence of vector 
pCW109. Vector pCW109 contains a 550 bp fragment of the alpha' 

10 beta-conglycinin promoter (Slightom et al. (1 991 ) Plant Mol. Biol. Man. B1 6:1 ) and 
the same phaseolin polyadenylation signal sequence described above for 
pCW108N. The Nco I site located between the promoter and polyadenylation signal 
sequence fragments in plasmid pCW109 was eliminated by digestion with Nco I 
followed by fill-in with T4 DNA polymerase in the presence of dATP; dCTP, dGTP 

15 and dTTP. The resulting DNA was digested with Kpn I, which cuts 3' of the filled-in 
Nco I site, and the isoflavone synthase fragment introduced. The cassette 
containing the IFS chimeric gene (alpha' beta-conglycinin promoter/isoflavone 
synthase/phaseolin 3' polyadenylation sequence) was liberated from this plasmid by 
digestion with Hind III and introduced into Hind Ill-digested pZBL102 to form 

20 pWSJOOL 

EXAMPLE 2 

Transformation of Somatic Soybean Embryo Cultures 
and Regeneration of Soybean Plants 
The ability to alter the isoflavone levels in transgenic soybean plants 
25 expressing the CRC recombinant expression construct was tested by transforming 
soybean somatic embryo cultures with plasmids pOY203 and pWSJ001 , screening 
for transformants expressing only the CRC recombinant expression construct, 
allowing plants to regenerate, and measuring the levels of isoflavones produced. 
The present invention does not require the presence of plasmid pWSJ001. 
30 Screening for the presence of the transgenes was performed by PCR amplification, 
and plants containing the isoflavone synthase recombinant-expression construct 
were excluded from this work. 

Soybean embryogenic suspension cultures were transformed with pOY203 in 
conjunction with pWSJ001 by the method of particle gun bombardment, and 
35 transformants carrying the CRC recombinant expression construct in pOY203, and 
not the IFS recombinant expression construct in pWSJOOl , were identified. 
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The following stock solutions and media were used for transformation and 
regeneration of soybean plants: 
Stock Solutions (per Liter): 

MS Sulfate 100x stock: 37.0 g MgSC>4.7H20, 1.69 g MnSC^.h^O, 0.86 g 
5 ZnS0 4 .7H 2 0, 0.0025 g CUSO4.5H2O. 

MS Halides 100x stock: 44.0 g CaCI 2 .2H20, 0.083 g Kl, 0.00125 g 
CoCI 2 .6H 2 0, 17.0 g KH 2 P0 4j 0.62 g H 3 B0 3j 0.025 g Na 2 Mo0 4 .2H20, 3.724 g 
Na 2 EDTA, 2.784 g FeS04.7H 2 0. 

B5 Vitamin stock: 100.0 g myo-inositol, 1.0 g nicotinic acid, 1.0 g pyridoxine 
10 HCI, 10.0 g thiamine. 

2,4-D stock: 10 mg/mL 
Media (per Liter): 

SB55: 10 mL of each MS stock, 1 ml_ of B5 Vitamin stock, 0.8 g NH4N0 3j 
3.033 g KN0 3i 1 mL 2,4-D stock, 0.667 g asparagine, pH 5.7. 
15 SB103: 1 pk. Murashige & Skoog salt mixture (Gibco BRL), 60 g maltose, 2 g 

gelrite, pH 5.7. 

SB71-1: B5 salts, 1mL B5 vitamin stock, 30 g sucrose, 750 mg MgCI2, 2 g 
gelrite, pH 5.7. 

Soybean (of the Jack variety) embryogenic suspension cultures were 
20 maintained in 35 mL SB55 liquid media on a rotary shaker (150 rpm) at 28°C with a 

mix of fluorescent and incandescent lights providing a 16 hour day, 8 hour night 

cycle. Cultures were subcultured every 2 to 3 weeks by inoculating approximately 

35 mg of tissue into 35 mL of fresh liquid media. 

Soybean embryonic suspension cultures were transformed by the method of 
25 particle gun bombardment (see Klein et al. (1 987) Nature 327:70-73) using a 

DuPont Biolistic PDS1000/He instrument. Embryos were co-bombarded with 

plasmid pOY203 (containing the CRC recombinant expression construct) and 

plasmid pWSJ001 (containing the IFS recombinant expression construct). 

Transformants containing the CRC recombinant expression construct alone were 
30 identified by PCR and are described herein. Transformants containing the IFS 

recombinant expression construct were used for other purposes and do not form 

part of the present invention. 

For bombardment, 5 jxL of a 1:2 mixture of pOY203 (0.5 ng/jiL) and pWSJ001 

(1 ng/uL) plasmid DNA, 50 j*L CaCI 2 (2.5 M), and 20 uL spermidine (0.1 M) were 
35 added to 50 pL of a 60 mg/mL 0.6 (am gold particle suspension. The particle 

preparation was agitated for 3 minutes, spun in a microfuge for 10 seconds and the 
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supernatant removed. The DNA-coated gold particles were then washed once with 
400 jjI of 100% ethanol, resuspended in 40 jaL of anhydrous ethanol, and sonicated 
three times for 1 second each. Five of the DNA-coated gold particles was then 
loaded on each macro carrier disk. 
5 Approximately 300 to 400 mg of two-week-old suspension culture was placed 

in an empty 60 mm X 15 mm petri dish and the residual liquid removed from the 
tissue using a pipette. The tissue was placed about 3.5 inches away from the 
retaining screen and bombarded twice. Membrane rupture pressure was set at 
1 100 psi and the chamber was evacuated to -28 inches of Hg. Two plates were 

10 bombarded for each experiment and, following bombardment, the tissue was 
divided in half, placed back into liquid media, and cultured as described above. 

Eleven days after bombardment, the liquid media was exchanged with fresh 
SB55 media containing 50 mg/mL hygromycin. The selective media was refreshed 
weekly. Seven weeks post bombardment, green, transformed tissue was observed 

15 growing from untransformed, necrotic embryogenic clusters. Isolated green tissue 
was removed and inoculated into individual flasks to generate new, clonally 
propagated, transformed embryogenic suspension cultures. Thus, each new line 
was treated as an independent transformation event. Soybean suspension cultures 
can be maintained as suspensions of embryos clustered in an immature 

20 developmental stage through subculture or can be regenerated into whole plants by 
maturation and germination of individual somatic embryos. 

Transformed embryogenic clusters were removed from liquid culture and 
placed on SB 103 solid agar media containing no hormones or antibiotics. Embryos 
were cultured for eight weeks at 26°C with mixed fluorescent and incandescent 

25 lights on a 16 hour day, 8 hour night schedule. During this period, individual 

embryos were removed from the clusters and analyzed at various stages of embryo 
development. Selected lines were assayed by PCR amplification for the presence 
of the CRC recombinant expression construct and/or the IFS recombinant 
expression construct. 

30 5'- AGGCGGAAGAACTGCTGCAACG -3' [SEQ ID NO.i] 

5'- AGGTCCATTTCGTCGCAGAGGC -3' [SEQ ID NO:2] 

5-ATGTTTG G CAAGTAG G AAG G G ACC -3' [SEQ ID NO:3] 

5'-GCATTCCATAAGCCGTCACGATTC -3' [SEQ ID NO:4] 

The presence of the CRC recombinant expression construct was determined 
35 using primerl and primer2 (shown in SEQ ID NO:1 and SEQ ID NO:2, respectively) 
which produce a fragment that is not present in wild type soybean embryos. The 
presence of the IFS recombinant expression construct was determined using 
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primer3 and primer4 (shown in SEQ ID NO:3 and SEQ ID NO:4, respectively). 
Separation, on an agarose gel, of the amplification products obtained with this pair 
of primers yielded a 1062 bp fragment indicative of the endogenous IFS gene (i.e., 
containing introns) in all samples and an 845 bp fragment in the embryos also 
5 containing the IFS recombinant expression construct. Embryos containing the CRC 
recombinant expression construct and not the IFS recombinant expression construct 
were selected for further study. 

Somatic embryos became suitable for germination after eight weeks and were 
then removed from the maturation medium and dried in empty petri dishes for 1 to 
10 5 days. The dried embryos were then planted in SB71 -1 medium where they were 
allowed to germinate under the same lighting and germination conditions described 
above. Germinated embryos were transferred to sterile soil and grown to maturity. 
Seeds were harvested. 

EXAMPLE 3 

15 Analysis of Isoflavones in R1 Seed of Transformants 

Containing the CRC Recombinant expression construct 
Isoflavone levels were analyzed in seed from soybean primary transformants 
(R1 seed) containing the CRC recombinant expression construct and not the IFS 
recombinant expression construct. Extracts were prepared and analyzed by HPLC 

20 as follows. Each seed was weighed and placed in a 2 ml_ screw cap tube 

containing a %" cylindrical bead and 20 mg flavone (as internal standard). The seed 
was then crushed using a bead beater at 4200 rpm for 30 second intervals until 
reduced to a fine powder. The sample was homogenized into solution by the 
addition of 800 jiLof 80% aqueous methanol and further bead beating. Each 

25 sample was left in a shaking water bath at 60°C for 4 hours and then centrifuged at 
12000 rpm for 10 minutes. A 100 |nL aliquot of the supernatant was removed and 
added to 100 jxL deionized water, vortexed, centrifuged, and analyzed by HPLC. An 
HP 1 100 instrument equipped with a diode array detector and a Phenomenex, Luna 
3 C18(2), 4.6 mm x 150 mm column was used for HPLC analysis. The column 

30 temperature was 22°C, the solvent flow rate was 1 mL/min, and the detection was 
performed at 260 nm. The solvent elution consisted of a gradient from 5% 
methanol/ 95% 0.1% trifluoroacetic acid (TFA) in water to 100% methanol over 
16 minutes followed by a 3 minute post-run wash. This resulted in chromatograms 
depicting daidzein, glycitein, genistein, and their conjugate derivatives. Standard 

35 curves were constructed with each analysis and individual compounds were 
measured. All of the conjugates were converted to aglycone equivalent values 
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using standard conversion factors. In addition to total concentrations for each 
aglycone, the total isoflavone content was also calculated. 

Generally, five individual seeds from each of one to three plants from each 
transformation event were analyzed. Seeds from primary transformant plants from 
5 a total of 1 3 transformation events, each carrying only the CRC recombinant 
expression construct, were analyzed as well as seed from primary transformant 
plants not carrying a transgene. A subset of the CRC events showed an altered 
isoflavonoid composition as compared to the controls. Observing a phenotype in a 
portion of transgenic plants is explained by the usual variation of expression of a 

10 transgene that occurs in independent transformation events. 

The isoflavone component profile for seeds from a control plant is shown in 
Figure 10, seeds #1-5. This control plant came from a transformation experiment 
but it was PCR negative for the CRC recombinant expression construct. In this 
typical control profile, genistein is the most abundant of the isoflavones. Daidzein is 

15 generally the next highest level component with glycitein lowest, although in some 
seeds the daidzein and glycitein levels can be similar. This example also shows a 
substantial amount of variation in the levels of the individual isoflavones, as well as 
the sum of all isoflavone levels, among individual seeds from the same plant. An 
obvious change in the isoflavone component profile could be seen in seeds 

20 obtained from plants representing four independent transformation events (Figure 
10). The R1 seeds from the hemizygous primary transformants would be expected 
to be segregating for the transgene. Among the seeds analyzed from the 1-1 , 1-2, 
and 1 -35 event plants there are seeds with an altered profile as well as seeds with 
the control profile described above. All of the seeds from the 1 -25 event had an 

25 altered profile indicating that all five seeds contain the transgene. This could be due 
to the presence of multiple segregating loci or due to the selection by chance of five 
single seeds, each containing a single locus. 

The glycitein levels were the least affected in these seeds with altered 
isoflavone components. However in some seeds, particularly of the 1-1 event, the 

30 glycitein levels were increased about two-fold above the level in wild type segregant 
seeds of the same transformant (Figure 10, seeds # 6, 10, 13). The pathway for 
glycitein synthesis is not defined, but may be a part of the daidzein branch due to 
more similarity in glycitein structure with daidzein than with genistein. An enzyme 
encoded by the CYP71D9 P450 that may be involved in glycitein synthesis was 

35 recently characterized (Latunde-Dada et al. (2001) J. Biol. Chem. 276: 1688-1695). 
If daidzein and glycitein are closely related, the CRC transgene has the effect of 
activating the daidzein/glycitein branch of isoflavone synthesis. In some individual 
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seeds from CRC transformants having high daidzein levels the total isoflavone 
levels were increased. Out of the 16 seeds with altered isoflavone profiles (shown 
in Figure 10), 14 seeds had higher total isoflavone levels than the seeds from the 
control plant. It may be concluded that the total isoflavone level in individual seeds 
5 is quite variable, but CRC can, in some cases, raise the level further. The 

inconsistency of this effect suggests that there must be other factors that contribute 
to establishing the final total isoflavone levels. 

The altered isoflavone profile of seeds in these four events is distinguished by 
greatly increased levels of total daidzein, the highest level being raised about 

10 four-fold when compared to the daidzein levels in control and wild type segregating 
seeds (seeds #10 and 17). The same individual seeds with high levels of daidzein 
also had greatly decreased levels of genistein, in some instances decreased to 
almost undetectable levels (seeds # 6, 11, 13, for example). These changes result 
in the daidzein component contributing 60% to 80% of the total isoflavones in the 

15 altered phenotype seeds, while daidzein is generally 20% to 35% of the total in 

control and wild type segregating seeds (Figure 11). In the altered phenotype seeds 
the genistein component ranges from a low of almost 0% up to 14%, while in control 
and wild type segregating seeds the range is between 43% and 60% of the total 
isoflavones (Figure 12). The reduction in genistein level varied between the 

20 different transformation events, with event 1-2 having the greatest genistein 

reduction, almost to zero. In event 1-25 genistein was only reduced to between 6% 
and 14% of total isoflavones, this is still much below control levels. 

Figure 2 shows the total daidzein to total genistein ratios for individual seeds 
obtained from plants from the 1-1, 1-2, 1-25, and 1-35 transformation events having 

25 an increased total daidzein to total genistein ratio as well as from control seeds. 
The control seeds are obtained either from plants resulting from transformation 
experiments that, during PCR amplification, were negative for the CRC recombinant 
expression construct, or from plants transformed with a recombinant DNA 
expression construct that does not alter the isoflavonoid profile. The ratios for two 

30 seeds obtained from the 1 -2 event are not shown because their ratios are too high 
(784.0 and 801 .0) to plot on the same chart. While the total daidzein to total 
genistein ratios for control seeds ranged between 0.3 and 1.6, the ratios for the 
seeds from the four transformation events with the novel high total daidzein 
phenotype ranged between 4.7 and 801.0. The exact ratio of total daidzein to total 

35 genistein was variable between individual seeds, even within a single transformation 
event. However, it is clear that expression of the CRC recombinant expression 
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construct in soybean seeds altered the total daidzein to total genistein ratio from 
being less than 2, to being over 4.5. 

Most of the seeds from the four transformation events having increased total 
daidzein to total genistein ratios also had increased levels of total isoflavones. 
Figure 3 depicts a graph showing the total isoflavone levels for the same seeds as 
the total daidzein to total genistein ratios are shown for in Figure 2. Of 24 seeds 
analyzed that showed high total daidzein to total genistein ratios, 18 had total 
isoflavone levels higher than the highest total isoflavone values for the controls. 
The control seeds had total isoflavonee levels ranging from 199 to 1833 |_ig/g seed 
weight. Eighteen of the seeds showing increased total daidzein to total genistein 
ratios had total isoflavone levels between 2003 and 4737, while the remaining six 
seeds showing increased total daidzein to total genistein ratios had total isoflavone 
levels between 348 and 1808. Thus, expression of the CRC recombinant 
expression construct in soybean seeds produced higher levels of total isoflavones in 
a majority of the seeds having high total daidzein to total genistein ratios. 

EXAMPLE 4 

Analysis of Isoflavones in R2 Seed of Transformants 
Containing the CRC Recombinant expression construct 

R1 seeds from the 1-1, 1-2, 1-25 and 1-35 events described above were 
planted in the field at the Stine location in Newark, DE and R1 seeds from the 1-1 
event were also planted in pots and grown in a growth room. Seeds were harvested 
(R2 seed) and analyzed for isoflavone levels. Single seed extracts were prepared 
and analyzed as described in Example 3 with the following modifications. No 
internal standard was added. Samples were extracted in 80% methanol for 1 hour 
at 27°C. After centrifugation, 500 juL of supernatant was transferred to a fresh 2 mL 
tube. An additional 500 yL of 80% methanol was added to the ground seed left in 
the tube, the mix was resuspended for 30 seconds using a Spex 2000 Geno-grinder 
at 1620 strokes/min, and the centrifugation repeated. Another 500 jj.L of 
supernatant was combined with the 500 juL in the fresh tube, the sample vortexed, 
centrifuged again, and 300 added to 300 juL of deionized H2O and vortexed. 
The sample was assayed by HPLC under the conditions of Example 3 except that 
the column temperature was 25°C, and the detection was at 262 nm. The data was 
calculated as described in Example 3. 

It was noted that individual seeds with the high total daidzein to total genistein 
ratio also had a brown stripe along the median of the seed. These seeds had a dark 
brown stripe around the median on the side opposite to the hilum, parallel to the 
cotyledon axis, as opposed to the overall light tan of seeds having a control 
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phenotype. Some of the brown striped seeds were smaller than control seeds and 
some were slightly wrinkled. Cutting the seeds showed that the brown pigmentation 
was only on the external coat and did not extend into the cotyledons. 

To further investigate a possible correlation between the visual phenotype and 
5 isoflavone profile, plants were grown in the growth chamber from either tan or brown 
striped R1 seeds from the 1-1, 1-2, and 1-25 events and the harvested R2 seeds 
observed. All plants grown from tan seeds produced only tan seeds. The 1-25 
plant grown from a brown stripe seed produced 17 brown striped seeds, consistent 
with there being multiple loci in this line. The 1-1 and 1-2 plants grown from brown 

10 striped seeds all produced segregating brown striped and tan seeds. For the 1-2 
line the segregation was 3:1 brown striped to tan, indicating a dominant trait. For 
the 1-1 line, the segregation ratio was 2:1, suggesting either lower penetrance of the 
trait or a possible association with a recessive seed-lethal phenotype. Isoflavone 
levels were analyzed in individual brown striped and tan seeds from the 1-1 and 1-2 

15 events, as well as brown stripe seeds from the 1-25 event. In every case, the brown 
striped seeds had high daidzein and low genistein, while the tan seeds had the high 
genistein control profile (Figure 13 and Figure 14, respectively). Thus the brown 
stripe cosegregates with an altered isoflavone phenotype in seeds obtained from 
CRC transformants. This visual phenotype provided a means of identifying CRC 

20 homozygotes as well as wild type segregants. 

Thus, seeds with the high total daidzein to total genistein trait could be 
identified visually before analysis. Plants that were wild type segregants from the 
CRC transformation event lines were identified as those plants producing only 
seeds without the brown stripe and the controls for the field-grown R2 seeds were 

25 obtained from these plants. 

The total daidzein to total genistein ratio for single R2 seeds from field-grown 
plants: either plants with no seeds with a brown stripe (wild type segregants) or 
plants with seeds segregating for the brown stripe are shown in Figure 4. The 
transgenic plants expressing the CRC recombinant DNA construct were segregating 

30 for the phenotype but only data for seeds with a brown stripe are shown. The total 
daidzein to total genistein ratios in the wild type segregants were between 0.6 and 
0.7 while the total daidzein to total genistein ratios in seeds having a brown stripe 
along the median of the seed ranged between 2.9 and 128.0. Of the 18 seeds 
having a brown stripe along the median that were analyzed, 16 had total daidzein to 

35 total genistein ratios equal to or greater than 20, while the other two seed had ratios 
of 2.9 and 4.5. Clearly, the high total daidzein to total genistein ratio was inherited 



33 



WO 02/101023 



PCT/US02/21107 



in second generation plants of the 1-1, 1-2, and 1-25 events as demonstrated by the 
isoflavone component levels in the R2 field-grown seeds. 

The control seeds for plants grown in the growth room was seed lacking the 
brown stripe along the median and harvested from the same plant as the seed 
5 having the brown stripe along the median of the seed. The total daidzein to total 
genistein ratios for the R2 seeds from growth room plants was obtained and is 
shown in Figure 5. For the R2 seeds from plants of the 1-1 event grown in the 
growth-room the total daidzein to total genistein ratios were much higher than the 
ratios for control seeds. The control seeds had total daidzein to total genistein ratios 

10 between 0.5 and 0.6 while seeds with the brown stripe down the median had total 
daidzein to total genistein ratios between 13.6 and 64.4. 

The total isoflavone levels in the R2 seeds from field grown plants were 
measured and are summarized in Figure 6. While some seeds containing the 
brown stripe along the median had total isoflavone levels about two times that of 

15 seeds not having the brown stripe, some of this brown stripe seeds had lower total 
isoflavone levels than the wild type segregant seeds without the brown stripe. Of 
the seeds from plants resulting from the 1-1 transformation event, all of the seeds 
with the brown stripe along the median had total isoflavone levels greater than the 
wild type segregant seeds. Of the seeds from plants resulting from the 1-25 

20 transformation event, the total isoflavone levels of the seeds with the brown stripe 
along the median were greater than the wild type segregant for all but one of the 
seeds analyzed. Of the seeds from plants resulting from the 1-2 transformation 
event, the total isoflavone levels for one of the wild type segregant seeds was higher 
than the usual control range. The total isoflavone level for this seed was higher than 

25 the total isoflavone levels for all of the seed from the 1-2 transformation event 

having the brown stripe along the median. However, all but one of the seeds from 
the 1-2 transformation event and having the brown stripe along the median had total 
isoflavone levels greater than the rest of the wild type segregant seeds (for the 1-1, 
1-25, and 1-2 events). 

30 The total isoflavone levels in the R2 seeds from plants of the 1 -1 

transformation event grown in the growth-room are shown in Figure 7. Seeds 
having the brown stripe along the median had higher total isoflavonw levels than the 
control seeds. 

As shown in Figure 15, the field grown brown striped R2 seeds from all three 
35 events had high daidzein levels, and in general had much reduced genistein, with 
levels around 2%. Even the 1-25 event, which had the least reduced genistein in 
the R1 seed, showed a greater genistein reduction in field grown seeds (Figure 16). 
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Thus variations in the extent of genistein reduction occurred between generations 
and environments. Two individual seeds, one from the 1-1 event and one from the 
1-25 event, are notable in having about 15%-17% genistein. This shows that there 
are also variables that affect the genistein levels even in individual seeds from the 
5 same plant. However, overall R2 seeds having the CRC transgene continued to 
show increased daidzein levels as well as the reduced genistein levels. Also the 
total isoflavone level was increased in some seeds, but again not consistently 
(Figure 17). 

In summary, the total isoflavone levels of second generation seeds were 
10 higher in most instances for seeds having a brown stripe along the median 

(indicative of the higher total daidzein to total genistein ratio and of the presence of 
the CRC recombinant expression construct) than control wild type segregant seeds. 
EXAMPLE 5 

Analysis of Isoflavones in R3 Seed of Transformants 

15 Containing the CRC Recombinant Expression Construct 

Plants were grown in the growth room from R2 seeds harvested from growth 
room grown plants from the 1-1, 1-2, and 1-25 transformation events and seeds 
were harvested (R3) and analyzed for isoflavone content. Extracts were prepared 
and analyzed in bulk samples as follows. Eight seeds from each plant were 

20 combined and ground in a non-commercial grinder. A 200 mg sample was weighed 
and transferred to a 2 mL vial. The sample was then prepared and assayed as 
described in Example 4. The controls for this experiment were R3 seeds from wild 
type segregants producing only non-brown striped seeds. For each transformation 
event one sample was analyzed from each of one control plant and three plants 

25 containing the CRC recombinant expression construct and the results are shown in 
Figure 8. The total daidzein to total genistein ratios in the wild type segregant bulk 
seed samples ranged between 0.7 and 0.8. The total daidzein to total genistein 
ratios in the samples from plants having the CRC recombinant expression construct 
ranged between 5.3 and 71.8. Clearly, the high total daidzein to total genistein ratio 

30 was inherited in third generation plants of the 1-1, 1-2, and 1-25 events as 
demonstrated by the isoflavone component levels in the R3 seeds. 

The total isoflavone levels in the bulk R3 samples are shown in Figure 9. In 
the R3 seeds, all bulk seed samples from the plants having the CRC recombinant 
expression construct had total isoflavone levels greater than all of the control plant 

35 seed samples. 
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EXAMPLE 6 

Analysis of the Expression of Genes of the Phenylpropanoid Pathway 
in R4 Seeds of Transformants Containing the 
CRC Recombinant Expression Construct 
5 Northern blot and immunoblot analyses were performed to determine the 

genes in the phenylpropanoid pathway affected by the expression of the CRC 
recombinant expression construct. Probes were prepared to detect mRNA from 
phenylalanine ammonia lyase (PAL), cinnamic acid 4-hydroxylase (C4H), chalcone 
isomerase (CHI), chalcone reductase (CHR), isoflavone synthase (IFS), flavanone 

10 3-hydroxylase (F3H), dihydroflavonol reductase (DFR), flavonol synthase (FS), and 
isoflavone reductase (IFR). RNA was prepared from seed of R3 plants containing 
the CRC recombinant expression construct or from controls, transferred to a 
membrane and hybridized with the probes mentioned above. These Northern blot 
analyses indicated that the levels of PAL, C4H, CHI, CHR, F3H, DFR, and FS were 

15 increased in the seed of transgenic plants expressing the CRC recombinant 

expression construct compared to controls. Immunoblot analyses were performed 
on protein samples derived from seed of the same plants, using anti-CHS, 
anti-CHR, or anti-IFS antisera. The protein expression profiles of CHR and IFS 
genes correlated with their RNA expression profiles. The CHS protein was 

20 increased in seed of CRC transgenic plants, suggesting higher expression of the 
CHS gene. 

Northern Blot analyses 

R3 generation plants of the 1-1 event were grown in the growth chamber. 
Plants homozygous for the CRC recombinant expression construct, producing only 

25 brown-striped seeds, and wild type segregants, producing only tan seeds, were 
grown. Immature seeds were harvested at two stages of development, at 
approximately 10-d ays-after flowering and 20-days-after flowering weighing 
approximately 150 mg and 250 mg, respectively. Total RNA and protein from these 
materials were extracted separately. For RNA extraction, a modified Trizol method 

30 (Gibco BRL, Life Technologies, Rockville, MD) was applied. Approximately 5 seeds 
for each sample were ground together in liquid nitrogen and 500 mg of the powder 
were extracted with 7.5 mL of Trizol reagent for 5 min. Three mL of chloroform was 
added, mixed, and the 4-ml aqueous phase was collected. The RNA was 
precipitated by the addition of 4 mL of iso-amyl alcohol. After centrifugation and 

35 removal of the liquid phase, the RNA precipitate was washed with 75% ethanol and 
air-dried for 20 min. The RNA was resuspended in 400 uL of water and from each 
sample, an amount equivalent to 30 |ig of RNA was loaded in each lane of a precast 
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Rilant RNA Gel (FMC, Rockland, ME). The RNA components were separated by 
electrophoresis and transferred to a membrane following standard protocols for 
RNA separation and Northern blotting (Sambrook). 

Probes were prepared from clones identified in the DuPont EST proprietary 
database as encoding the desired genes. The sequence of the entire cDNA insert 
in each chosen clone (except srr1c.pk001.k4) was obtained to verify that the insert 
represented the correct gene. The clones used to prepare the probes are shown in 
Table 2 together with the name of the encoded polypeptide and the corresponding 
identifier (SEQ ID NO:) as used in the attached Sequence Listing. 

TABLE 2 

Clones Used in the Preparation of Probes for the 
Detection of RNA from Genes of the Phenylpropanoid Pathway 



Clone Encoded Polypeptide SEQ ID NO: 



Sdp3c.pk002.c22 


PAL (phenylalanine ammonia lyase) 


5 


src3c.pk014.e17 


C4H (cinnamic acid 4-hydroxylase) 


6 


ssm.pk0013.e3* 


CHI (chalcone isomerase) 


7 


src3c.pk009.e4 


CHR (chalcone reductase) 


8 


POY204* 


IFS (isoflavone synthase) 


9 


sfllpk0040.g11* 


F3H (flavanone 3-hydroxylase) 


10 


sfl1.pk131.g5** 


DFR (dihydroflavonol reductase) 


11 


sre.pk0043.d11** 


DFR (dihydroflavonol reductase) 


12 


ssl.pk0057.d12 


FS (flavonol synthase) 


13 


srr1c.pk001.k4 


IFR (isoflavone reductase) 


14 



Some of these clones have been described in other patent applications. For example, clone 
ssm.pk0013.e3 is described in U.S. Patent No. 6,054,636; clone sfl1.pk0040.g11 is described in 
PCT publication No. WO 99/43,825, and clone pOY204 is described in PCT publication No. 
WO 00/44,909. 



** Both clones were used together to prepare the probe. 

Probes were prepared by the random primer method using the Random 
Primers DNA Labeling System from GIBCO-BRL, Life Technologies according to 
the manufacturer's protocol. The entire plasmid was used as template for all 
probes, except for IFS, where the template was a PCR product containing the IFS 
coding region. This PCR amplification product was obtained as described in 
Example 1, above, for the preparation of WSJ001. 

The entire random primer reaction mixture, without purification, was used for 
hybridization. Hybridization conditions were based on a protocol from PerfectHyb 



37 



WO 02/101023 



PCT/US02/21107 



Buffer (Sigma-Aldrich, St. Louis, MO). Hybridizations were carried out overnight at 
68°C. The membranes were then washed twice with 2 X SSC buffer (GIBCO BRL, 
Life Technologies) and once with 0.1 X SSC for 15 minutes each at 68°C. 
Immunoblot analyses 

Antibodies to CHS and CHR were prepared by Covance (Richmond, CA) to 
protein purified from E. coli expressing the CHS or CHR coding region using 
standard methods. The IFS antibody was prepared to synthetic peptides of the IFS 
protein as described in WO 00/44,909. Standard protocols were used for 
immunoblot analyses with anti-CHS, anti-CHR, or anti-IFS antisera. The 
Supersignal West Pico chemiluminescent substrate (Pierce, Rockford, IL) was used 
for visualization of the bound antibodies for CHS and CHR, while the Femto 
chemiluminescent substrate (Pierce, Rockford, IL) was used for IFS. 

Table 3 shows the relative detection of the RNA and/or protein of the different 
genes in the isoflavonoid pathway in either seeds from wild type segregant control 
plants or seeds from the CRC recombinant expression construct plants, harvested 
at 150 mg or 250 mg. One plus sign (+) indicates that the RNA or protein is clearly 
detected; +/- indicates that the RNA or protein is barely detected; and more than 
one plus sign indicates the approximate increase in detection of the particular RNA 
or protein levels. 

TABLE 3 

Levels of Expression of Phenvlpropanoid Pathway Genes in 





RNA Level 


Protein Level 




150 mg 


250 mg 


150 mg 


250 mg 


Gene 


WT 


CRC 


WT 


CRC 


WT 


CRC 


WT 


CRC 


PAL 


+ 


+++++ 


+ 


+++++ 


nd* 


nd 


nd 


nd 


C4H 


+ 


++++ 


+ 


++++ 


nd 


nd 


nd 


nd 


CHS 


nd 


nd 


nd 


nd 


+ 


+++++ 


+ 


+++++ 


CHI 


+ 


+++ 


+ 


+++ 


nd 


nd 


nd 


nd 


CHR 


+ 


+++ 


+ 


+++ 


+ 


++ 


+ 


++ 


F3H 


+ 


++++ 


+ 


++++ 


nd 


nd 


nd 


nd 


DFR 


+ 


++++ 


+/- 


++++ 


nd 


nd 


nd 


nd 


FS 


+/- 


+++ 


+/- 


+++ 


nd 


nd 


nd 


nd 


IFS 


++ 


++ 


++ 


++ 


+ 


+ 


+ 


+ 


IFR 


+/- 


+/- 


+/- 


+ 


nd 


nd 


nd 


nd 



*not determined 
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These results indicate that expression of particular soybean genes of the 
phenylpropanoid pathway, as listed above, was increased when the CRC 
recombinant expression construct was expressed in soybean seed. In the upper 
phenylpropanoid pathway the most dramatic changes were observed in expression 
5 of the PAL and CHS genes. Expression of C4H, CHI, and CHR was also increased 
significantly. 

Expression of IFS was not increased. IFR, an enzyme involved in the synthesis 
of glyceollins from daidzein, was not increased in the younger seed and had a slight 
increase in the older seed. 

10 Expression of some genes encoding enzymes involved in the 

flavonol/anthocyanin branch of the phenylpropanoid pathway was increased by 
CRC expression. These include F3H, DFR, and FS. 

It was determined that soybean seed expressing the CRC recombinant 
expression construct present a brown stripe along the median making them easy to 

15 identify. From the analysis of R1 , R2, and R3 seed it was determined that the levels 
of total isoflavones and total daidzein to total genistein ratios vary both in control 
seed and in seed containing the CRC recombinant expression construct. 

Overall the total daidzein to total genistein ratios for seed containing the CRC 
recombinant expression construct ranged between 2.9 and 801.0 and for samples 

20 from control seed ranged between 0.3 and 1.6. There is no overlap in these ranges. 
Of the seed examined, the total isoflavone levels were higher in the R1 seed 
from plants expressing the CRC recombinant expression construct than in plants not 
expressing the CRC recombinant expression construct. With two exceptions the 
total isoflavone levels of R2 seed obtained from field-grown plants were higher in 

25 seed from plants expressing the CRC recombinant expression construct compared 
to seed from plants not expressing the recombinant expression construct. In this 
instance there were two outliers, one seed from the 1-25 transformation event 
containing the CRC recombinant expression construct had lower total isoflavone 
levels than seed from the wt-segregants, and one seed from a wt-segregant of the 

30 1-2 transformation event had unusually high total isoflavone levels. All R3 seed 
examined containing the brown stripe along the median had higher total isoflavone 
levels than seed from wt-segregants. 
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EXAMPLE 7 

Identification of Intermediates of the Phenvlpropanoid Pathway 
that Accumulate in Transformants 
Containing the CRC Recombinant expression construct 
5 Mass spectroscopy was used to determine the differences in HPLC profiles 

between soybean seeds expressing the CRC recombinant DNA fragment and 
control seeds. Using mass spectroscopy three compounds were identified that are 
almost undetectable in wild type seed but present in seed expressing the CRC 
recombinant expression construct. Each of the additionally identified compounds 

10 has an m/z of 505 but differ in retention times of 15.46, 21.29, and 21.75 min 

(compare Figure 18, from wild type seed, and Figure 19 from seed expressing the 
CRC recombinant expression construct). MS2 analysis produced one major 
fragment with an m/z of 257 for each of the compounds. This mass indicates the 
loss of a fragment with a mass of 248, which is consistent with fragmentation of a 

15 malonyl-glucose from a conjugated compound. 

Liquiritigenin and isoliquiritigenin, intermediates in daidzein synthesis, both 
have a mass of 256 matching the m/z of 257 detected for each of the unknown 
peaks. The unknowns were further analyzed by first using in-source fragmentation 
(source collision induced dissociation) to remove the 248 m/z fragment leaving the 

20 257 m/z species, followed by MS2. The initial fragmentation was done under 

conditions determined to be ideal for removal of the malonyl-glucose moiety from 
the malonyl-glucose derivatives of daidzein and genistein. MS2 produced the same 
fragments of 239, 147, and 137 for each of the three unknowns. Analysis of 
liquiritigenin and isoliquiritigenin standards showed MS1 spectra with a major peak 

25 of m/z 257 and MS2 fragments of 239, 147, and 137 for each compound. These 
results suggest that the three unknowns are malonyl-glucose derivatives of 
liquiritigenin and/or isoliquiritigenin. 

Further characterization of the liquiritigenin and isoliquiritigenin standards 
showed that the UV spectra and retention times could be used to distinguish the two 

30 compounds. The UV spectrum of liquiritigenin matched that of the unknown with the 
1 5.5 retention time, while the spectra of the unknowns at 21 .3 and 21 .8 both are 
similar to the isoliquiritigenin UV spectrum (data not shown). The retention times of 
the unknowns, when compared to the 18.3 and 27.1 retention times of liquiritigenin 
and isoliquiritigenin, respectively, also match expectations based on the differences 

35 between retention times for flavonoid aglycones and their corresponding malonyl- 
glucose conjugates. From this and the above data, it is concluded that the unknown 
at 15.5 is the malonyl-glucose conjugate of liquiritigenin, and the unknowns at 21.3 
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and 21.8 are malonyl-glucose conjugates of isoliquiritigenin. Conjugation of 
isoliquiritigenin at two different positions probably accounts for the latter two peaks. 
Accumulation of these intermediates in the CRC seed suggests that the isoflavone 
synthase catalyzed reaction may be limiting (Figure 1), although increased capture 
5 of intermediates by enhanced activities of genes encoding enzymes involved in 
conjugation is also a possibility. 
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CLAIMS 

1 . A method of altering the isoflavonoid profile of an isoflavonoid-producing 
plant, said method comprising: 

5 (a) transforming a plant with (i) a first recombinant expression construct 

comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a recombinant expression construct comprising a 
promoter operably linked to an isolated nucleic acid fragment encoding an R 
myc-type transcription factor, (ii) a second recombinant expression construct 

10 comprising a promoter operably linked to an isolated nucleic acid fragment encoding 
a C1 myb transcription factor and a promoter operably linked to an isolated nucleic 
acid fragment encoding an R myc-type transcription factor, or (iii) a recombinant 
expression construct comprising a promoter operably linked to an isolated nucleic 
acid fragment encoding all or part of a C1 myb transcription factor and all or part of 

15 an R myc-type wherein said construct is capable of functioning as both a C1 myb 
transcription factor and an R myc-type transcription factor; and 

(b) growing the transformed plant under conditions that are suitable for the 
expression of the recombinant expression construct or constructs; 

wherein expression of the construct or constructs alters the isoflavonoid profile 

20 of the transformed plant by increasing the total daidzein to total genistein ratio 
compared to the total daidzein to total genistein ratio of a control. 

2. The method of Claim 1 wherein the plant is transformed with a 
recombinant expression construct comprising a promoter operably linked to an 
isolated nucleic acid fragment encoding a chimeric transcription factor comprising 

25 the maize R coding region situated between the C1 DNA binding domain and the C1 
activation domain. 

3. The method of Claim 1 or Claim 2 wherein the promoter is a seed-specific 
promoter. 

4. The method of Claim 1 or Claim 2 wherein the isoflavonoid-producing plant 
30 is selected from the group consisting of soybean, clover, mung bean, lentil, hairy 

vetch, alfalfa, lupine, sugar beet, and snow pea. 

5. An isoflavonoid-producing plant made by the method of Claim 1 or 2 
wherein said plant has an increased total daidzein to total genistein ratio compared 
to the total daidzein to total genistein ratio of a control. 

35 6. The isoflavonoid-producing plant of Claim 5 wherein said plant is selected 

from the group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, 
lupine, sugar beet, and snow pea. 



42 



WO 02/101023 



PCT/US02/21107 



7. Seeds or plant parts of the plant of Claim 5. 

8. Seeds or plant parts of the plant of Claim 6. 

9. An isoflavonoid-containing product having an increased ratio of total 
daidzein to total genistein obtained from the seeds or plant parts of Claim 7. 

5 10. An isoflavonoid-containing product having an increased ratio of total 

daidzein to total genistein obtained from the seeds or plant parts of Claim 8. 

1 1 . The isoflavonoid-containing product of Claim 9 wherein the isoflavonoid 
product is selected from the group consisting of protein isolate, protein concentrate, 
meal, grits, full fat and defatted flours, textured proteins, textured flours, textured 

10 concentrates, and textured isolates. 

12. The product of Claim 10 wherein the isoflavonoid-containing product is 
selected from the group consisting of protein isolate, protein concentrate, meal, 
grits, full fat and defatted flours, textured proteins, textured flours, textured 
concentrates, and textured isolates. 

15 13. An extracted isoflavonoid-containing product having an increased ratio of 

total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 7. 

14. An extracted isoflavonoid-containing product having an increased ratio of 
total daidzein to total genistein wherein said product is extracted from the seeds or 

20 plant parts of Claim 8. 

15. A food which has incorporated therein the product of Claim 9. 

16. A food which has incorporated therein the product of Claim 10. 

17. A beverage which has incorporated therein the product of Claim 9. 

18. A beverage which has incorporated therein the product of Claim 10. 

25 19. An isoflavonoid-containing soy protein product having an increased ratio of 

total daidzein to total genistein obtained from the seeds of Claim 8 wherein the 
seeds are soybean seeds. 

20. The product of Claim 19 wherein the isoflavonoid product is selected from 
the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 

30 defatted flours, textured proteins, textured flours, textured concentrates, textured 
isolates, soymilk, tofu, fermented soy products, and whole bean soy products. 

21. An extracted isoflavonoid-containing soy protein product having an 
increased ratio of total daidzein to total genistein wherein said product is extracted 
from the seeds of Claim 8 wherein the seeds are soybean seeds. 

35 22. A food which has incorporated therein the product of Claim 19. 

23. A beverage which has incorporated therein the product of Claim 19. 



43 



WO 02/101023 



PCT/US02/21107 



24. A method of producing an isoflavonoid-containing product 
which comprises: 

(a) cracking the seeds of Claim 7 to remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
5 thickness. 

25. A method of producing an isoflavonoid-containing product which 
comprises: 

(a) cracking the seeds of Claim 8 remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 

10 thickness. 

26. The method of Claim 25 wherein the seeds are soybean seeds. 

27. An isoflavonoid-producing plant comprising in its genome 

(i) a first recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a C1 myb transcription 

15 factor and a recombinant expression construct comprising a promoter operably 
linked to an isolated nucleic acid fragment encoding an R myc-type transcription 
factor, 

(ii) a second recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding a C1 myb transcription 

20 factor and a promoter operably linked to an isolated nucleic acid fragment encoding 
an R myc-type transcription factor, or 

(iii) a recombinant expression construct comprising a promoter 
operably linked to an isolated nucleic acid fragment encoding all or part of a C1 myb 
transcription factor and all or part of an R myc-type wherein said construct is 

25 capable of functioning as both a C1 myb transcription factor and an R myc-type 
transcription factor; 

wherein said plant has an increased total daidzein to total genistein ratio when 
compared to the total daidzein to total genistein ratio of a control. 

28. The isoflavonoid-producing plant of claim 27 wherein the recombinant 
30 expression construct (iii) comprises a promoter operably linked to an isolated 

nucleic acid fragment encoding a chimeric transcription factor comprising the maize 
R coding region situated between the C1 DNA binding domain and the C1 activation 
domain. 

29. The isoflavonoid-producing plant of claim 27 or 28 wherein the promoter is 
35 a seed-specific promoter. 

30. The isoflavonoid-producing plant of Claim 27 or Claim 28 wherein the 
isoflavonoid-producing plant is selected from the group consisting of soybean, 
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clover, mung bean, lentil, hairy vetch, alfalfa, lupine, sugar beet, and snow pea. 

31. The isoflavonoid-producing plant of Claim 27 or 28 wherein said plant has 
an increased total daidzein to total genistein ratio compared to the total daidzein to 
total genistein ratio of a control. 
5 32. The isoflavonoid-producing plant of Claim 31 wherein said plant is selected 

from the group consisting of soybean, clover, mung bean, lentil, hairy vetch, alfalfa, 
lupine, sugar beet, and snow pea. 

33. Seeds or plant parts of the plant of Claim 31 

34. Seeds or plant parts of the plant of Claim 32. 

10 35. An isoflavonoid-containing product having an increased ratio of total 

daidzein to total genistein obtained from the seeds or plants parts of Claim 33. 

36. An isoflavonoid-containing product having an increased ratio of total 
daidzein to total genistein obtained from the seeds or plant parts of Claim 34. 

37. The product of Claim 35wherein the isoflavonoid product is selected from 
15 the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 

defatted flours, textured proteins, textured flours, textured concentrates, and 
textured isolates. 

38. The product of Claim 36 wherein the isoflavonoid-containing product is 
selected from the group consisting of protein isolate, protein concentrate, meal, 

20 grits, full fat and defatted flours, textured proteins, textured flours, textured 
concentrates, and textured isolates. 

39. An extracted isoflavonoid-containing product having an increased ratio of 
total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 33. 

25 40. An extracted isoflavonoid-containing product having an increased ratio of 

total daidzein to total genistein wherein said product is extracted from the seeds or 
plant parts of Claim 34. 

41 . A food which has incorporated therein the product of Claim 35. 

42. A food which has incorporated therein the product of Claim 36. 

30 43. A beverage which has incorporated therein the product of Claim 35 

44. A beverage which has incorporated therein the product of Claim 36. 

45. An isoflavonoid-containing soy protein product having an increased ratio of 
total daidzein to total genistein obtained from the seeds of Claim 34 wherein the 
seeds are soybean seeds. 

35 46. The product of Claim 44 wherein the isoflavonoid product is selected from 

the group consisting of protein isolate, protein concentrate, meal, grits, full fat and 
defatted flours, textured proteins, textured flours, textured concentrates, textured 
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isolates, soymilk, tofu, fermented soy products, and whole bean soy products. 

47. An extracted isoflavonoid-containing soy protein product having an 
increased ratio of total daidzein to total genistein wherein said product is extracted 
from the seeds of Claim 34 wherein the seeds are soybean seeds. 
5 48. A food which has incorporated therein the product of Claim 40. 

49. A beverage which has incorporated therein the product of Claim 40. 

50. A method of producing an isoflavonoid-containing product 
which comprises: 

(a) cracking the seeds of Claim 33 to remove the meats from the hulls; 

10 and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
thickness. 

51 . A method of producing an isoflavonoid-containing product which 
comprises: 

15 (a) cracking the seeds of Claim 34 remove the meats from the hulls; and 

(b) flaking the meats obtained in step (a) to obtain the desired flake 
thickness. 

52. The method of Claim 50 wherein the seeds are soybean seeds. 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours 

<I20> A METHOD FOR ALTERING THE ISOFLAVONOID PROFILE IN THE PLANT PARTS OF 
AN ISOFLAVONOID-PRODUCING PLANT 

<130> BB1452 PCT 

<140> 
<141> 

<150> 60/297,981 
<151> 2001-06-13 

<160> 16 

<170> Microsoft Office 97 

<210> 1 
<211> 22 
<212> DNA 

<213> Artif icial/Oligonucleotide primer 
<400> 1 

aggcggaaga actgctgcaa eg 22 



<210> 2 
211> 22 
<212> DNA 

<213> Artif icial/Oligonucleotide primer 
<400> 2 

aggtccattt egtegcagag gc 22 



<210> 3 
<211> 24 
<212> DNA 

<213> Artificial/PCR primer 
<400> 3 

atgtttggca agtaggaagg gacc 24 



<210> 4 
<211> 24 
<212> DNA 

<213> Artificial/PCR primer 
<400> 4 

gcattccata agccgtcacg attc 24 



<210> 5 
<211> 1919 
<212> DNA 
<213> Glycine max 
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<400> 5 

gcacgagacc ccacacagca accagagcag ctatgctagt gagaatcaac acactcctcc 60 

aaggctactc aggaatcagg tttgaaattt tggaggcaat cacaaagctt ctgaacaaca 12 0 

acattacccc atgtttgcca cttaggggaa caatcacagc atctggtgat cttgttcctt 18 0 

tgtcctacat tgctggtttg ctaactggta gaccaaactc caaggctgtt ggaccctctg 240 

gtgagattct gaatgccaaa gaagcctttg aattggccaa catcggtgct gagttctttg 300 

agttgcaacc taaggaaggc cttgcccttg tgaatggcac tgctgttggt tctggcttgg 360 

cctcaattgt tctatttgaa gccaacatca ttgctgtctt gtctgaagtt atttcagcaa 42 0 

tttttgctga agttatgcaa ggaaagcctg aattcactga ccatttgact cataaactaa 480 

agcaccaccc tggtcagatt gaagctgctg ctattatgga acacatbttg gaaggaagct 540 

cttacgtgaa agctgctaag aagttgcatg agattgatcc tttacaaaag cctaaacagg 600 

accgttatgc tcttaggact tcaccacaat ggcttggtcc tctaattgaa gtgattagat 660 

tctctaccaa gtcaattgag agggagatta actcagtcaa tgacaaccct ttgattgatg 720 

tgtcaaggaa caaggcactt catggtggta acttccaagg aactcctaht ggagtctcca 780 

tggataatac acgtttggct cttgcttcaa ttggtaaact catgtttgct caattctctg 840 

agcttgtcaa tgattattac aacaatggtt tgccttcaaa tctcactgcc agcagaaacc 90 0 

ccagcttgga ttatggattc aagggagctg aaattgccat ggcatcttat tgttctgaac 960 

ttcaatattt ggcgaatccg gtgacaagcc acgtgcaaag cgcggagcaa cacaaccaag 1020 

atgtgaactc tctggggctg atttcatcaa ggaagactca tgaggctatt gagatcctca 108 0 

agctcatgtc ctccactttc ctggtcgccc tttgccaagc cattgacttg aggcatttgg 1140 

aggagaattt gaagaacacg gtcaagaacg ttgtgagtca agttgctaag aggactctca 1200 

ccacaggtgt caatggagag cttcaccctt caaggttttg tgagaaggac ttgctcaagg 1260 

ttgttgatag ggagtacaca tttgcataca ttgatgaccc ctgcagtgga acataccctt 132 0 

tgatgcaaaa gctaaggcaa gtgcttgtgg actatgcatt ggccaatgga gagaacgaga 138 0 

agaacacaaa cacatcaatc ttccaaaaga ttgcaacatt tgaggaagag ttgaagaccc 1440 

ttttgcctaa ggaagtggaa ggtgcaagag ttgcatatga gaatgaccaa tgtgcaattc 1500 

caaacaagat caaggaatgc aggtcttacc ccttgtacaa gtttgtgaga gaggagttgg 1560 

ggacagcatt gctaactggt gaaagggtta tctcaccggg tgaagagtgt gacaaagtgt 162 0 

tcactgcttt gtgccaaggg aagatcattg atccactttt ggaatgcctt ggggagtgga 1680 

atggggcacc tcttccaata tgttagtttt tcttattttc tgttttcttg aagagtggtt 1740 

tcttttctgt acacgtgttt gtgttgatat taagcatttg gtttgtctat ataaggctgt 1800 

ggcaaatcaa tccacataca acaacttccc agttttcctt gatgtatgcc atgcaaggaa 1860 

cttgtaattc ataatgtaat agaattccat ttgtttgccg taaaaaaaaa aaaaaaaaa 1919 

<210> 6 

<211> 1812 

<212> DNA 

<213> Glycine max 

<400> 6 

ccaaactgaa ccaaatcatc acacacacat actaagaagc aacaattctt ctacaatgga 60 

tctcctcctt ctggaaaaga ccctcatagg tctcttcctc gctgcggtgg tcgccatcgc 12 0 

cgtctccacc ctccgcggcc ggaaattcaa gctcccaccg ggcccactcc ccgtcccaat 18 0 

cttcggcaac tggctccaag tcggcgacga cctcaaccac cgcaacctca ccgatttggc 24 0 

caaaaaattc ggtgacatct tcctcctccg catggggcag cgcaacctcg tcgfcggtttc 300 

ttcccctgag ctcgccaaag aggttctcca cacgcagggc gtggagttcg gctcccgcac 360 

ccgcaacgtc gtcttcgaca tcttcaccgg aaagggccaa gacatggtct tcaccgtcta 42 0 

cggcgagcac tggcgcaaaa tgcgccgcat catgaccgtc cccttcttca ccaacaaggt 480 

tgtgcaacaa taccgccatg gatgggaatc ggaggctgcc gccgtcgtcg aggacgtcaa 54 0 

gaaaaacccc gacgccgccg tctccggcac cgtcatccgc cgccgccttc agctcatgat 600 

gtacaacaac atgtaccgca taatgttcga ccggaggttc gagagcgagg aggatcccat 660 

cttccagagg ctaagagcct tgaacggaga gaggagtcgc ttggcgcaga gctttgagta 72 0 

taactatggt gattttattc ccatcttgag acccttcttg aagggttact tgaagatttg 780 

caaggaggtg aaggagacga ggttgaagct tttcaaggat tacttcgtbg acgagaggaa 84 0 

gaagcttgga agcaccaaga gcaccaacaa caataatgaa cttaaatgcg ctattgacca 900 

cattttggat gcccagagaa aaggcgagat caacgaagac aacgtcctct acattgttga 960 

aaacatcaac gttgctgcaa ttgaaacaac tctatggtcg attgagtggg gcattgctga 1020 

gcttgtgaac cacccagaga tccagcaaaa gttaagggat gagattgaca gagttcttgg 1080 

agcagggcac caagtgactg agccagacat ccaaaagctc ccatacctcc aagcagtggt 114 0 
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caaggaaact cttcgtctta gaatggcaat ccctctcctt gtaccacaca tgaacctcca 12 00 

cgacgcaaag cttgggggct atgatatccc agctgagagc aagatcttgg tgaatgcatg 12 60 

gtggctggcc aacaaccctg cacactggaa gaagccagag gagttccggc ctgagaggtt 1320 

cttcgaggag gagtcgcttg ttgaagccaa tggcaatgac tttaggtacc ttccctttgg 13 80 

tgttggcaga agaagctgcc ctggaatcat tcttgcattg ccaattcttg gcatcacttt 1440 

gggacgtttg gtccaaaact ttgagctctt gcctccccct ggccagtcac agattgacac 1500 

tagtgagaaa ggagggcaat ttagcttgca catactcaag cattccacca ttgtggcaaa 1560 

gccaaggtca ttttagactc caccacatga tcatgatcat atgatcaatc cccttaattt 1620 

tttctttttt cttttccctt ttctttactc tgtattgtat caatgcttga aaatggggtt 1680 

gttccaaaaa tgtcatatat atttggcctg ctaatgggta ttgtaaatct ctgaacttga 1740 

agtgatgcac ggttttgaga tggttttcta ataaacctac acttttgtct ctaaaaaaaa 18 00 

aaaaaaaaaa aa 1812 

<210> 7 

<211> 828 

<212> DMA 

<213> Glycine max 



<400> 7 

gttaaataga aaagaggagt ttgagaatgg caacgatcag cgcggttcag gtggagttcc 60 

tggagtttcc agcggtggtt acttcaccag cctccggcaa gacctatttc ctcggcggcg 120 

caggggagag aggattgacg attgagggga agttcataaa gttcacaggc ataggagtat 180 

acttggagga taaggcggtg ccatcactcg ccgctaagtg gaagggtaaa acttcagagg 240 

agttagttca caccctccac ttctacaggg atatcatttc agggccgttt gaaaagctaa 3 00 

ttagagggtc gaagattctg ccattggctg gcgctgaata ctcaaagaag gtgatggaaa 3 60 

actgcgtggc acacatgaag tctgttggga cttacggtga tgctgaagcc gcagccattg 420 

aaaagtttgc tgaagccttc aagaatgtga attttgcacc tggtgcctct gttttctaca 4 80 

gacaatcacc tgatggaatc ttggggctta gtttctctga agatgcaaca ataccagaaa 540 

aggaggctgc agtgatagag aacaaggctg tatcagcggc ggtcttggag accatgattg 600 

gtgaacatgc tgtttcccct gacttaaaac gcagtttggc ttctcgattg cctgcggtat 660 

tgagccacgg cattatagtc tgagaaatga gaaggatcaa ctttaccttt ttcaaatatt 720 

cttgtttttc tcctttcttt cttgtcgctt gtcatgtatt tctactgttt tattaaataa 7 80 

taaaattgag ttctgttaga gttggtgaaa aaaaaaaaaa aaaaaaaa 82 8 

<210> 8 
<211> 1394 
<212> DNA 
<213> Glycine max 



<400> 8 

gttagaatgg ctgctgctat tgaaatcccc acaatagtgt ttccaaactc ctctgcccaa 60 

cagaggatgc cagtggttgg aatgggatct gcccctgact tcacatgcaa gaaagacaca 120 

aaggaggcta tcattgaggc cgtgaaacag ggttacagac acttcgacac tgctgctgct 180 

tatggctctg aacaggctct cggtgaagct ctcaaggaag ctatccatct tggcctcgtc 240 

tcccgccaag acctctttgt cacttccaag ctttgggtca ccgaaaatca tcctcatctt 3 00 

gtccttcctg ctttgcgcaa atcacttaaa actcttcaac tagagtactt ggacctgtat 3 60 

ctcatccact ggcccctgag ttctcagcca gggaagttct catttccaat tgaagtagaa 420 

gatctcttgc cttttgacgt gaagggtgtg tgggaatcca tggaagagtg ccagaaactt 480 

ggcctcacca aagccattgg agtcagcaac ttctctgtca agaagcttca gaatctgctc 540 

tctgttgcta ccatccgtcc cgtggtcgat caagtggaga tgaaccttgc atggcaacag 600 

aagaagctaa gagagttctg caaagaaaat gggataatag tgactgcgtb ctctccactg 660 

aggaaaggtg cgagcagggg cccaaatgaa gtgafcggaga atgatgtgct gaaggagatt 720 

gcagaggctc atgggaaatc catagcccag gtgagtctga gatggttgta cgaacaaggt 780 

gtgacatttg tgccaaagag ctacgataag gagaggatga accagaatct gcacatcttt 840 

gactgggcat tgactgaaca agatcatcac aaaataagtc aaatcagcca gagccgtttg 900 

atcagcggac ccaccaaacc acaactcgct gatctctggg atgatcaaat ataaactatt 960 

tactactatg cagctcccac tctattttta taatccatct ttttacctct tgttcatttg 1020 

tgccaaagag ctacgataag gagaggatga accagaatct gcacatcttt gactgggcat 1080 

tgactgaaca agatcatccc aaaataagtc aaatcagcca gagccgtttg atcagcggac 1140 

cccccaaacc acaactcgct gatctctggg atgatcaaat ataaactatt tactactatg 1200 
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tctattttta 


taatccatct 


ttttacctct 


tgtttcattt 


tacgtttaaa 




taattcatgc 




cttafctttag 


atttcacaat 


caataaacta 


ggcacgcgcg 


1320 


gcacatgata 


tgaataaact 


atgttcaatt 


tttttttcaa 


aaaaaaaaaa 


aaaaaaaaaa 


1380 


aaaaaaaaaa 


aaaa 










1394 


<210> 9 














<211> 1756 














<212> DNA 














<213> Glycine max 












<400> 9 














gtaattaacc 


tcactcaaac 


tcgggatcac 


agaaaccaac 


aacagttctt 


gcactgaggt 


60 


ttcacgatgt 


tgctggaact 


tgcacttggt 


ttgtttgtgt 


tagctttgtt 


tctgcacttg 


12 0 


cgtcccacac 


caagtgcaaa 


atcaaaagca 


cttcgccacc 


tcccaaaccc 


tccaagccca 


180 


aagcctcgtc 


ttcccttcat 


fcggccacctfc 


cacctcttaa 






240 


gcactcatcg 


atctctccaa 


aaagcatggc 


cccttattct 


ctcfcctcctt 


cggctccatg 


300 


ccaaccgtcg 


ttgcctccac 


ccctgagttg 


ttcaagctct 


tcctccaaac 


ccacgaggca 


360 


acttccttca 


acacaaggtt 


ccaaacctct 


gccataagac 


gcctcactta 


cgacaactct 


4^0 


gtggccatgg 


ttccattcgg 


accttactgg 


aagttcgtga 


ggaagctcat 


catgaacgac 


480 


cttctcaacg 


ccaccaccgt 


caacaagctc 


aggccfctfcga 


ggacccaaca 


gatccgcaag 


540 


ttccttaggg 


ttatggccca 


aagcgcagag 


gcccagaagc 


cccttgacgt 


caccgaggag 


600 


cttctcaaat 


ggaccaacag 


caccatctcc 


atgatgatgc 




tgaggagatc 


660 


agagacatcg 


ctcgcgaggt 


tcttaagatc 


ttcggcgaat 


acaqcctcac 




720 


tggcctttga 


agtatctcaa 


ggttggaaag 




att 


tt 


780 


aagttcgacc 


ctgtcgttga 


aagggtcatc 


aagaagcgcc 


gtgagatcgt 


cagaaggaga 


840 


aagaacggag 


aagfcfcgfcfcga 


gggcgaggc c 


agcggcgtct 


fcccfccgacac 


tttgcttgaa 


900 


ttcgctgagg 


acgagaccat 


ggagatcaaa 


afcfcaccaagg 


agcaaatcaa 


gggccttgtt 


960 


gtcgactttt 


tctctgcagg 


gacagattcc 




caacagagtg 


ggcattggca 


1020 


gagctcatca 


acaatcccag 


ggtgttgcaa 


aaggctcgtg 


aggaggtcta 




1080 


ggcaaagata 


gactcgttga 


cgaagtfcgac 


actcaaaacc 




tagggccatt 


1140 


gtgaaggaga 


cattccgaat 


gcacccacca 


ctcccagfcgg 


tcaaaagaaa 


gtgcacagaa 


1200 


gagtgtgaga 


ttaatgggta 


tgtgatccca 


cfacfaoaacafc 


tggttctttt 


caatgtttgg 


1260 


caagtaggaa 


gggaccccaa 


stactgggac 


agaccatcag 


aattccgtcc 




1320 


ttagaaactg 


gtgctgaagg 


aaaaacaaaa 


cctcttgatc 




gcatttccaa 


1380 


ctcctcccat 


ttgggtctgg 


gaggagaatg 


tgccctggtg 


tcaatttggc 


tacttcagga 


1440 


atggcaacac 


ttcttgcatc 


tcttatccaa 


tgctttgacc 


tgcaagtgct 


gggccctcaa 


1500 


ggacaaatat 


tgaaaggtga 


tgatgccaaa 


gbtagcatgg 


aagagagagc 


tggcctcaca 


1560 


gttccaaggg 


cacatagtct 




ccacttgcaa 


ggatcggcgt 


tgcatctaaa 




ctcctttctt 


aattaagata 


atcatcatat 


acaatagtag 


tgtcttgcca 


tcgcagttgc 


1680 


tttttatgta 


ttcataatca 


tcatttcaat 


aaggtgtgac 


tggtacttaa 


tcaagtaatt 


1740 


aaggttacat 


acatgc 










1756 


<210> 10 














<211> 1465 














<212> DNA 














<213> Glycine max 












<400> 10 














gcacgagggc 


acgaggaagc 


attgcattct 


gctatttaat 


tccactac t 


acac cacat 


60 


tctcctcaaa 


gacaacaatg 


gcaccaacag 


ccaagactct 


qacttacctq 


accc 9 ^ 10 * 1 


120 


aaaccctaga 


atcgagcttc 


gttcgggacg 


aggaggagcg 


tccc aacrtt 


„„ rt . a ^ 9a9a 


180 


aattcagcga 


cgagatccca 


gtgatttctc 


ttgccggaa t 


cgacgaggcg 


at 


240 


gaagagagat 


ttgtgagaag 


atcgtggagg 


cttgcgagaa 


ttggggtata 


ttccaggttg 


300 


ttgatcacgg 


tgtggatcaa 


caactcgtgg 


ccgagatgac 


ccgtctcgcc 


aaagagttct 


360 


ttgctttgcc 


accggacgag 


aagcttcgtt 


ttgatatgtc 


cggcgccaaa 


aagggtggat 


420 


tcattgtctc 


cagccatctc 


caaggggaat 


cggtgcagga 


ctggagagaa 


atagtgacat 


480 


acttttcgta 


cccaaaaaga 


gagagggact 


attcaaggtg 


gccagacacg 


ccagaagggt 


540 


ggagatcggt 


gactgaggaa 


tacagcgaca 


aagtaatggg 


tctagcttgc 


aagctcatgg 


600 


aggtgttgtc 


cgaagcaatg 


gggttagaga 


aagagggttt 


aagcaaagca 


tgtgttgaca 


660 


tggaccagaa 


ggtggtggtt 


aattactacc 


ccaaatgccc 


tcaacctgac 


ctcactcttg 


720 
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gcctgaagcg 


ccacacggat 


ccgggcacta 


tcaccttgct 


gcttcaggac 


caagtgggtg 


780 


gacttcaagc 


caccagggac 


aatggcaaaa 


catggatcac 


cgttcagcct 


gtggaggctg 


840 


ccttcgtcgt 


caatcttgga 


gatcatgctc 


attatctgag 


caatggaagg 


ttcaagaatg 




ctgatcacca 


agcggtggtg 


aactcaaacc 


atagccgttt 


gtccatagcc 


acttttcaaa 


960 


acccagcacc 


aaatgcaact 


gtttaccctc 


tgaagataag 


agaaggagag 


aagcctgtga 


1020 


tggaggaacc 


aatcactttt 


gctgaaatgt 


acaggaggaa 


gatgagcaag 


gacattgaga 




ttgcaaggat 


gaagaagctg 


gctaaggaaa 


agcatttgca 


ggaccttgag 


aatgaaaagc 


114 0 


atttgcaaga 


acttgatcag 


aaggcaaaac 


ttgaggccaa 


gcctttgaag 


gagattcttg 


1200 


cttaattaat 


aataatfcaca 


tatgtatcat 


~ t9 ^t 9 t CC 




tttagtattt 


i^9n 


tttaagggcc 


atgaattaat 






ttttgtacgt 


cttatgattt 




atcctttgtg 


gggafc at c at 


gtgttgtgtt 


cagttgccta 


tgtcttatta 




1380 


tcatctatgt 


ataccttata 


tgtgcctcta 


ttataaatga 


aaataagtgg 


cactgtcttt 


1440 


attaaaaaaa 


aaaaaaaaaa 


aaaaa 








1465 


<210> 11 














<211> 1279 














<212> DNA 














<213> Glycine max 












<400> 11 














ataattcaac 


tgttttgggg 


tgtgattaaa 


agagaagcta 


gctaaaaaaa 


atgggttcag 


60 


catccgaaag 


tgtttgcgtt 


acaggagctt 


ctggtttcat 


cgggtcatgg 


cttgtcatga 


12 0 


gactcatcga 


gcgtggctac 


accgttcgag 


ccaccgtacg 


cgacccagta 


aacatgaaga 


180 


aggtgaagca 


tttggtggaa 


ctaccaggcg 


caaagagcaa 


actgtctctg 


tggaaggctg 


240 


atcttgctga 


agagggaagc 


tttgatgaag 


ccattaaagg 


ctgcaccgga 


gttttccacg 


300 


tggccacccc 


catggacttt 


gaatccaaag 


accctgagaa 


tgaagtgata 


aagcctacaa 


360 


taaatggggt 


actagacatc 


atgaaagcat 


gcttgaaggc 


aaaaactgtg 


cgaaggctaa 


420 


tattcacgtc 


ctcagccgga 


accctcaacg 


ttattgagcg 


ccaaaagccc 


gttttcgacg 




acacatgctg 


gagtgacgtt 


gagttttgcc 


gtagagttaa 


gatgactggt 


tggatgtatt 


540 


ttgtttctaa 


aacactggcg 


gagaaagaag 


catggaaatt 


tgccaaagag 


cagggcctgg 


600 


acttcatcac 


tatcattcca 


cctcttgttg 


tcggtccctt 


tctgafcgcca 


accatgccac 


660 


ctagcctaat 


cacggctcta 


tcgccaatca 


caggaaatga 


ggaccattac 


tcgatcataa 


720 


agcaaggtca 


attcgtccac 


ttagatgatc 


tctgtcttgc 


tcacatattt 


ctgtttgagg 




aaccagaagt 


ggaagggagg 


tacatatgca 


gtgcatgtga 


cgctaccatt 


catgacattg 


840 


ccaaattaat 


taaccaaaaa 


taccctgagt 


acaaggtccc 


caccaagttc 


aagaatattc 


900 


cagatcaatt 


ggagcttgtg 


agattttctt 


ccaagaagat 


cacagacttg 


ggattcaaat 


960 


ttaaatacag 


cttagaggac 


atgtacactg 


gagcaattga 


cacatgcaga 


gacaaagggc 


1020 


ttcttccgaa 


acctgcagaa 




ttactaaacc 


tggagaaact 


ccagtgaatg 


1080 


ccatgcataa 


ataggcattc 


atatctttgt 


atctgtgtga 


ggc g gca 


ac gctttt 




cttattccgt 


tgagtggctt 


ttcttgatta 




tttatgaaaa 


attagaaatg 


1200 


tgagtggctt 


gtaaggccag 


gttatcttca 


ataagttaat 


aaaaaccatc 


ttctaaagtc 


1260 


taaaaaaaaa 


aaaaaaaaa 










1279 


<210> 12 














<211> 1234 














<212> DMA 














<213> Glycine max 












<400> 12 














gcacgagatt 


ttatttttct 


ttctttcttt 


ggaagataaa 


gaatgggttc 


taagtccgaa 


60 


accgtttgcg 


ttactggggc 


ttctggttac 


atcggatcat 


ggcttgtcat 


gagactcatc 


120 


gagcgtggct 


ataccgttcg 


agccaccgta 


ctcgacccag 


ctgatatgag 


crcraGcrtcfaacr 




catttgctgg 


atctgccagg 


tgcagagagc 


aagctgtctc 


tgtggaaggc 


agaacttaca 


240 


gaagagggaa 


gctttgatga 


agccattaaa 


gggtgcacag 


gtgttttcca 


cttggccacc 


300 


cccgttgact 


ttaagtccaa 


agacccagag 


aatgaaatga 


taaagcctac 


aattcaagga 


360 


gtactaaaca 


tcatgaaagc 


atgcctgaag 


gcaaaaactg 


tccgaaggct 


agtattcacg 


420 


tcctcagccg 


gaactaccaa 


cattactgag 


caccaaaagc 


ctatcattga 


cgaaacctgc 


480 


tggactgatg 


ttgagttctg 


ccggagatta 


aatatgactg 


gttggatgta 


tttcgtttct 


540 


aaaacacttg 


cggagaaaga 


agcttggaaa 


tttgcgaaag 


agcacggcat 


ggacttcatc 


600 


gctatccttc 


cagctcttgt 


cattggtccc 


tttctactgc 


caacaatgcc 


ttctagcgtg 


660 
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atcagtgctc tttcacctat taacggaatt gaggcacatt attcaatcat aaagcaagct 72 0 

caattcgtcc acatagaaga tatctgtctt gctcacatat ttctgtttga acagccaaaa 780 

gcagaaggga ggtatatatg cagtgcatgt gacgttacta tccatgacat tgtaaaatta 840 

attaacgaaa aatacccaga gtacaaggtt cccaccaagt ttcagaacat tccagatcaa 900 

ttggagcccg tgagattttc ttccaagaaa atcacagact tgggattcca atttaaatac 960 

agcttagagg atatgtacac tggagcaatt gatacatgca tagagaaagg gcttcttcct 1020 

aaacctgcag aaattccagc gaatggcatc gagcataaat aaatataggt tttcatatct 10 80 

ttgcctcggt gatggctatg aatgttgctt ttcttgttca gtttctttaa tgatgtttcc 1140 

gttttgtgaa ttcgtagtca aaattgtaag tggtttgtaa gaccaaatta gttatctaca 12 00 

aattgtttaa tattatcaca aaaaaaaaaa aaaa 1234 

<210> 13 

<211> 1345 

<212> DKTA 

<213> Glycine max 

<400> 13 

gcacgagaat caacacacac aaacacaaca acatatggag gtgctaaggg tgcaaaccat 60 

agcttccaaa tccaaagatg ctgccatccc agccatgttt gttagggcag agacagagca 12 0 

accaggcatc acaaccgttc aaggggtgaa ccttgaggtg ccaattattg attttagtga 180 

cccagatgaa gggaaagtgg tgcatgagat tttggaggca agtagggact ggggcatgtt 24 0 

ccaaattgtg aaccatgaca tacctagtga tgttataaga aagttgcaaa gtgttgggaa 300 

aatgttcttt gagttgccac aagaggaaaa agagttgatt gctaagcctg ctgggtctga 360 

ttctattgaa gggtatggca caaagcttca gaaagaggtg aatggcaaga aagggtgggt 42 0 

ggatcatttg ttccacattg tgtggcctcc ttcctccatc aactacagtt tctggcccca 480 

aaacccccct tcttacaggg aagttaatga ggaatattgc aagcacctaa gaggagtggt 54 0 

agacaaattg ttcaaaagta tgtcggtagg gttggggctt gaagagaatg agctaaagga 60 0 

gggtgcaaat gaagatgaca tgcattatct tttaaaaatc aattattacc caccttgtcc 660 

atgtcctgat ctggtcttgg gtgtgccacc acacacagac atgtcctacc tcacaattct 72 0 

ggtgcctaac gaggtgcagg gccttcaagc atgtagggat ggccattggt acgatgttaa 780 

gtatgtcccc aatgccctcg ttattcacat tggcgaccaa atggagatac tgagcaatgg 840 

aaaatataag gcagtttttc acagaacaac agtgaacaaa gatgagacaa gaatgtcgtg 900 

gcccgtgttc atagaaccca aaaaggaaca agaagttggt cctcacccaa agttggttaa 960 

ccaagacaat ccaccaaaat acaaaaccaa gaaatacaag gattatgctt attgtaagct 102 0 

caataagatc cctcaatgaa tgaagtgggc actacgagat aattatagct catggtttct 108 0 

ctgttttgtt atttaattta tggaatgaaa gtgtgacttg tgggaagtag attaaataag 114 0 

atctgtgact aatgttggtt tctctgtttt gttatttaat ttatggaatg aaagtgtgac 120 0 

ttgtgggaag tagattaaat aagatctgtg actaatgttg gattaatctt atgcttttga 12 60 

agtaaataaa gatgacaaga aacagtttgt ctgttctgtt aaaaaaaaaa aaaaaaaaaa 132 0 

aaaaaaaaaa aaaaaaaaaa aaaaa 1345 

<210> 14 

<211> 911 

<212> DNA 

<213> Glycine max 

<400> 14 

agaccaagtc aagatcgttg cagcaataaa agaagctgga aacgtcaaga gatttttccc 60 

atctgaattt gggctggatg tggaccgtca cgatgcggct gagcctgtaa gagaagtttt 120 

cgaggaaaaa gcgaaaattc gaagagtaat tgaagctgaa ggaattcctt acacttacct 180 

atgttgtcat gcctttactg gttatttctt acgtaacctg gcacagattg acatcactgt 240 

tcctcctagg gacaaggbgt tcatacaagg agatggaaat gtgaaaggag cgtatattac 3 00 

tgaggcbgat gtgggaactt ttaccatcga agcagcaaat gaccctagag ccttgaacaa 3 60 

agccgtgcac ataagactcc caaacaatta tttgtcctta aatgatatca tctctttgtg 420 

ggagaaaaaa attgggaaaa ctcttgagaa aatttatgtt tcagaagaag aagttcttaa 4 80 

gcaaattaag gagacttctt tcctaaataa ttatcttctg gcactatacc actcacagca 540 

gataaaggga gatgcagtgt atgagattga ccctgccaaa gaccttgagg cttctgaggc 600 

ttatcctcac gtggaataca gcactgtttc tgaatatttg gatcagtttg tctgatttcc 660 

gaaatcttag gaaccaaagc acttatatga ttttatcact ctgcaaaatg cttaataaaa 720 

caaatgcagt tcttccttct gtttttcttt cagaaaaggc ctatgcaggc ttttttgctg 7 80 

6 



WO 02/101023 



PCT/US02/21107 



cactaccatt gtctgtttcg tagtggttgc ttgtgtttgt cctgttctca agaaagtatc 840 
cacatttaat cagtttaggt aagtgttcta cctgaaaaaa taataatata atcctgtttt 900 
aatattgtta c 911 

<210> 15 
<211> 21 
<212> DNA 

<213> Artificial/PCR primer 
<400> 15 

ttgctggaac ttgcacttgg t 21 



<210> 16 
<211> 32 
<212> DNA 

<213> Artificial/PCR primer 
<400> 16 

gtatatgatg ggtaccttaa ttaagaaagg ag 



32 
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Prepared using 


PCT-EASY Version 2.92 
(updated 01.01.2002) 


0-2 


International Application No. 




0-3 


Applicant's or agent's file reference 


BB1452PCT 




1 

1-1 
1-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to 
in the description on: 

page 
line 


6 

12 


1-3 

1-3-1 
1-3-2 

1-3-3 
1-3-4 


Identification of Deposit 

Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

29 July 1999 (29.07.1999) 
ATCC PTA371 


1-4 


Additional Indications 


NONE 


1-5 


Designated States for Which 
Indications are Made 


all designated States 


1-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


FOR RECEIVING OFFICE USE ONLY 
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This form was received with the 
international application: 

(yes or no) 




0-4-1 


Authorized officer 


fczzr 
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0-5 


This form was received by the 
international Bureau on: 




0-5-1 


Authorized officer 





